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STREPTOLYSIN O VARIANTS 



COPYRIGHT NOTICE 

A portion of the disclosure of this patent 
10 document contains material which is subject to copyright 
protection. The copyright owner does not object to the 
reproduction by anyone of the patent document or the 
patent disclosure, as it appears in the Patent and 
Trademark Office patent file or records, but otherwise 
15 reserves all copyright rights whatsoever. 

RELATED APPLTeaTTriMfl 

The application is related to United States 

20 Serial No. (Beckman Docket No. 128D-104) 

entitled "Streptolysin O Derivatives" by Craig w. Adams 
and Eva Y. Wang, and United States Serial No. 

. (Beckman Docket No. 128D-123) entitled 

"Antibodies to Streptolysin O Derivatives and Variants", 
25 by Craig w. Adams and Patty Pang. Both applications are 
being filed simultaneously herewith, and both are 
incorporated herein by reference. 



30 



FIELD OF THE INVENTION 

The present invention is generally related to 
Streptolysin O and more particularly to Streptolysin O 
variants produced by recombinant DNA technology. 



35 
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BACKGROUND OF THE INVENTION 

Disclosed herein is a variant version of the 
antigenic substance, Streptolysin O. Streptolysin O is 
5 associated in humans with, for example, rheumatic fever, 
such that immunodiagnostic assays for evidence of 
immunological response against Streptolysin O are 
routinely utilized. The disclosed variant version of 
Streptolysin O is produced by recombinant DNA techniques, 
10 is soluble upon expression, and has substantially no 
hemolytic activity. Prior to this invention, 
Streptolysin O could be obtained via the bacteria 
Streptococcus pyogenes. However, Streptolysin O, in its 
wild-type form, is a dangerous substance in that it will 
15 lyse cells, lyse neutrophils, inhibit lymphocyte 

transformation, and release histimine from plateletes. 
The toxic and pathogenic properties of Streptolysin O are 
typically monitored by the lysis of red blood cells. 

20 I. The Genetic Code 

The genetic code for a particular protein, 
such as Streptolysin O (hereinafter "SLO") , depends upon 
the sequential grouping of three nucleotides, referred to 
25 as a "codon," and the arrangement of such codons in 
relationship to each other. 

A "nucleotide" consists of a nucleoside and 
one or more phosphate groups. A "nucleoside" consists of 
30 a nitrogeneous base linked to a pentose sugar. A 
"pentose" sugar comprises five carbon atoms. In a 
molecule of deoxyribonucleic acid, or "DNA", the pentose 
sugar is "deoxyribose, " and the nitrogeneous base can be 
adenine ("A"), guanine ("G") , thymine ("T") or cytosine 
35 ("C") . In a molecule of ribonucleic acid, or "RNA", the 
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pentose sugar is "ribose", and the nitrogeneous bases are 
the same as DNA, except uracil ("U") replaces thymine. 
Three types of RNA, messenger RNA, or "mRNA" , transfer 
RNA, or "tRNA" , and ribosomal, or "rRNA" , translate the 
5 genetic information encoded in the DNA into, e.g., a 
polypeptide or a protein. Thus, genetic information is 
generally transferred as follows: DNA-»RNA-»protein . 



10 



15 



20 



25 



The sequence of the nitrogenous bases of the 
DNA molecule encodes the genetic information contained in 
that molecule. The sugar and phosphate groups of the DNA 
molecule perform a structural role, forming the backbone 
of a series of DNA molecules, referred to as a DNA 
"macromolecule." DNA consists of two complementary 
strands of nucleotide chains, and these strands are held 
together by (relatively) weak hydrogen bonds. The bases 
of each DNA molecule bind to each other: A always bonds 
with T and C always bonds with G. Thus, the sequence 
5'-ATCG-3' of a first strand lies immediately opposite a 
complementary sequence 3'-TAGC-5' on the other strand. 
This is referred to as "complementary base pairing." The 
process of complementary base pairing is referred to as 
"hybridization" and results in the formation of a stable 
double stranded DNA macromolecule. 



Each codon specifies one amino acid. "Amino 
acids" are the principal components of proteins, and 
"proteins" are the essential constituents of all living 
cells. There are 20 natural amino acids. Because there 
30 are four nucleotide bases (A, C, G and T) and three 

nucleotides per codon, there are 64 possible codons (4 3 ) . 
Accordingly, because there are only 20 natural amino 
acids, most amino acids are specified by more than one 
codon. This is referred to as "redundancy" or 
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"degeneracy" . For example, the codons GCG, GCA, GCT and 
GCC all encode for the amino acid Alanine. 



The codon ATG (Met amino acid codon) is the normal 
5 "start" codon. The codons TAA, TAG and TGA, which do not 
encode for amino acids, are normal "stop" codons. The 
formation of mRNA is established based upon the start 
codon of one strand of the double stranded DNA 
macromolecule such that the resulting single stranded 
10 mRNA will have a nucleotide sequence complementary to the 
sequence of a single strand of the DNA. When a stop 
codon is reached by the mRNA along the DNA molecule, 
translation is stopped. 

15 The regions along the DNA macromolecule which 

are translated from the mRNA are referred to as "exons" 
for eukaryotes, and "translated regions" for prokaryotes. 
"Genes" include exons (eukaryotes) and translated regions 
(prokaryotes). Thus, genes encode for proteins and/ or 

20 polypeptides. Mammals, for example, are eukaryotes ; 
bacteria, for example, are prokaryotes. 

The natural synthesis of protein takes place 
over a series of several steps. The first step is the 

25 formation of an mRNA macromolecule complementary to the 
DNA macromolecule, as noted above. Thereafter, tRNA is 
manufactured; the tRNA provides a complementary codon 
(" ant i -codon") for each codon on the mRNA macromolecule . 
Thereafter, rRNA will catalyze the assembly of the codon- 

30 specific amino acids resulting from the mRNA: tRNA into 
proteins and/ or polypeptides. 

II . Rflgomhl nant DNA Technology 

Most proteins are produced naturally in 
35 extremely small quantities. The advent of recombinant 
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DNA technology has allowed for the production of large 
quantities of proteins that were previously only 
available in such small quantities. 

5 The following describes a "typical" genetic 

manipulation as it might apply to Escherichia coli, a 
typical bacterial host used for cloning. 

In order to isolate, or "clone"/ a gene, a DNA 

10 library is constructed from a DNA sequence (referred to 
as a "genome") using vectors. A "vector" is a small 
circular molecule of double-stranded DNA that occurs 
naturally in bacteria, yeast and mammalian cells. 
Vectors generally comprise the following characteristics: 

15 (i) a DNA sequence encoding a selectable "marker" which 
assures that the vector will be maintained in an 
appropriate host cell (e.g., E. coli); (ii) a 
controllable transcriptional promoter — by 
"controllable" is meant that the promoter can be 

20 "switched on" by manipulation of, e.g. the environment of 
the vector; a "promoter" is a region of DNA sequence that 
when switched on produces large amounts of mRNA from the 
gene of interest inserted into the vector — different 
promoters (e.g., lac, trp . tac, etc.) have different 

25 rates of mRNA production; (iii) translational control 
sequences, for example, an appropriately positioned ATG 
start codon and ribosomal binding site; and (iv) a 
polylinker; a "poly linker" simplifies the insertion of 
the gene of interest in the correct orientation within 

3 0 the vector. Vectors can be engineered to provide 

restriction endonuc lease sites on either side of an ATG 
start codon located on the vector such that the gene of 
interest can be inserted next to the start codon ; this 
allows for immediate transcription of the gene upon 

35 activation of the promoter gene. 
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A "restriction endonucleasell is an enzyme 
which cuts the double-stranded DNA at specified sequences 
of four to eight nucleotides in length, and many 
restriction endonucleases produce staggered cuts that 
5 leave a short, single-stranded tail at the location of 
the cut. This end is referred to as a "cohesive" or 
"sticky" end because it can form complementary base pairs 
with another sticky end. The genome is cleaved (cut-up) 
by a specified restriction endo-nuclease corresponding to 
10 the restriction endo-nuclease used to cut the vector, and 
the individual pieces of the cleaved genome are inserted 
into the vector. Randomly cleaving the entire genome of 
a cell with a specific restriction endo-nuclease is 
typically referred to as the "shotgun" approach to gene 
15 cloning. The shotgun approach can produce an extremely 
large number of DNA fragments, all of which are inserted 
into vectors. 

The individual pieces of the genome and the 
20 vectors , having corresponding sticky ends, are "fused" or 
"annealed" together to form circular hybrid DNA 
"plasmids" comprising a portion of the genome and the 
vector. 

25 The plasmids are then introduced into host 

cells. There are two types of host cells, "eukaryotic" 
and "prokaryotic". An example of a eukaryotic host cell 
is the Chinese hamster ovary ("CHO") ; an example of a 
prokaryotic host cell is E. coli bacteria. For purposes 

30 of the discussion to follow, attention will focus on 
prokaryotic host cells. 

When the plasmids are introduced into the host 
cell, these cells are referred to as being "transformed" 
35 with the plasmids. As the cells grow and divide, the 
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plasmids will similarly replicate to produce copies of 
the plasmids containing the DNA fragments. Each 
transformed cell is referred to as a "genomic DNA clone" 
and the entire collection of transformed cells containing 
5 all of the different DNA fragments is referred to as a 
"genomic DNA library". 

In order to determine which genomic DNA clones 
contain the DNA seguence capable of being copied into a 

10 corresponding mRNA, it is necessary to separate or 

"screen" the genomic DNA clones. There are several ways 
to accomplish this task including, for example, the use 
of radioactive DNA probes or evidence of immuno- 
reactivity. Screening can be an extremely labor 

15 intensive process because, as noted, the shotgun approach 
by definition leads to the formation of an extensive 
number of genomic DNA clones, which must be screened to 
find potential candidates of interest. 

20 III. Mutations 



DNA macromolecules are chemically guite 
similar to each other. A and G are guite similar in 
chemical composition, and C, T and U are egually similar. 

25 Thus, in a specified seguence, substitutions of an A for 
a G or a C for a T may occur. When such a substitution 
occurs within a codon such that the amino acid encoded 
thereby remains the same, then the substitution can be 
referred to as a "silent" substitution, i.e., the 

30 nucleotides are different but the encoded amino acid is 
the same. However, other substitutions can alter the 
amino acid encoded by the codon; when the nucleotide 
alteration results in a chemically similar amino acid, 
this is referred to as a "conservative" alteration, while 
35 a chemically different amino acid resulting from the 
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alteration is referred to as a "non-conservative" 
alteration. Non-conservative alterations of amino acids 
can result in a molecule quite unlike the original 
protein molecule. 

5 

A protein that has had its amino acids altered 
can be referred to as a "mutant", "mutation" or 
"variant." Mutations occur naturally and can have 
positive f negative or neutral consequences on the 
10 organism experiencing such a mutation. However , such 

mutation rates are typically very low, i.e., about 10** to 
about 10' 10 mutations per base replicated. 

IV. Streptolysin O 

15 Streptolysin O ("SLO") has an approximate 

molecular weight of between about 65,000 and about 70,000 
daltons. SLO belongs to a class of oxygen sensitive 
("thiol-activated") , cell destroying ("cytolytic") toxin 
("cytotoxin") which are produced by gram-positive 
bacterial species belonging to four different genera 
(streptococcus, bacillus, Clostridium and listeria) . 



20 



SLO interacts with membrane cholesterol 
and exerts cytolytic-cytotoxic effects on a broad range 

25 of mammalian cells. Additionally, SLO has very potent 
cardio toxic properties. One of the toxic and pathogenic 
properties associated with SLO is its hemolytic activity, 
i.e. SLO will lyse red blood cells, resulting in the 
release of hemoglobin. SLO can be lethal to laboratory 

30 animals in relatively small doses. Injection of SLO into 
an animal typically results in its immediate death. 

Because SLO is produced by specified bacterial 
species, when these species "invade" a mammalian host, 
35 the SLO released by the bacteria is treated by the host 
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as a foreign protein. SLO, then, is an antigen. 
"Antigens" are high molecular weight compounds which upon 
entry into the blood stream of a vertebrate stimulate the 
transformation of the small lymphocytes of the B-type 
5 into lymphoblasts. The lymphoblasts secrete antibodies 
specific to the antigen stimulator. The antibodies are 
proteins possessing reactive sites specifically 
complementary to a reactive feature or site on the 
stimulating antigen. Antibodies generally have the 
10 property of rendering the antigen harmless to the host 
organism by occupying the immunologically active sites, 
or "epitopes" , on the antigen particles or molecules. 
Anti-SLO antibodies ("ASO") are therefore produced by the 
host in response to the secretion of SLO into the host. 
15 Approximately 80-85% of individuals with current 
streptococcal infection or their sequelae (an after 
effect of a disease or injury) will demonstrate elevated 
levels of ASO. 



20 Determination of previous and/ or current 

infection by an SLO producing bacteria (s. pyogenes) is 
possible using immunodiagnostic assaying techniques 
which, e.g., rely upon the hemolytic properties of SLO 
and the binding of ASO to SLO. Focusing on hemolytic 
25 immunodiagnostic assays for SLO, a patient sample is 
added to a known amount of SLO derived from a source 
other than the patient and this mixture is added to a 
known amount of red blood cells such as, for example, 
rabbit red blood cells. Because SLO has hemolytic 
30 properties, it will lyse these red blood cells. However, 
when ASO binds to SLO, the hemolytic properties of SLO 
are neutralized. Thus, if the sample is obtained from a 
patient having current streptococcal infection or their 
sequelae, there will be elevated levels of ASO in the 
35 sample. Accordingly, if the mixture results in high 
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levels of hemolytic activity, this indicates that there 
is little, if any, ASO in the serum sample (and hence 
little, if any, infection from the SLO secreting 
bacteria) because the known quantity of SLO in the 
5 mixture is capable of lysing the known quantity of red 
blood cells in the mixture. If the mixture does not lead 
to hemolytic activity, this is indicative of an amount of 
ASO in the sample sufficient to inactivate the known 
quantity of SLO in the mixture. Investigators refer to 
10 such an amount of ASO as a "titer". Typically, an ASO 
titer of greater than about 3 00 International Units /ml is 
indicative of infection by a bacterial source capable of 
secreting SLO. Other immunodiagniostic assays for 
determination of infection by SLO secreting bacteria 
15 include nephelometric and turbidimetric protocols. In 

order to utilize the immunodiagnostic assaying technique 
outlined above, it is necessary to have access to 
sufficient SLO to be added to the mixture. One source of 
SLO is culture broths containing the bacteria 
20 Streptococcus pyogenes ("S. pyogenes") . However, 
obtaining SLO in this manner is quite difficult and 
costly: for every liter of the S. pyogenes culture 
broth, only about 0.5mg of SLO can be expected; the 
typical media for growing S. pyogenes is expensive; S. 
25 pyogenes is a class 2 pathogen; and SLO obtained in this 
manner contains many other antigenic materials. 
Additionally, SLO obtained by this procedure tends to be 
unstable in liquid form. Accordingly, such SLO 
preparations are most typically supplied as lyophilized 
30 powder in vials. Before use, the lyophilized powder must 
be reconstituted in a suitable solvent. Unfortunately, 
such reconstituted SLO will rapidly lose its hemolytic 
activity and therefore it must be used within a brief 
period after reconstitution or discarded. This has one 
35 notable and negative consequence: it is usually 
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impossible to test individual serum samples as soon as 
they are obtained. Thus, laboratories which conduct ASO 
assays based upon hemolytic activity typically store the 
individual samples until a sufficient number are 
5 collected to enable economic use of the lyophilized SLO. 
This can result in an inordinate delay in obtaining test 
results. 

ASO assays which rely upon nephelometric or 
10 turbidimetric protocols need significant amounts of 
purified SLO. Because of the costs associated with 
obtaining significant quantities of purified SLO from S. 
pyogenes is expensive, the foregoing hemolytic based 
assay was the first ASO assay to become commercially 
15 available. 

Finally, because SLO produced by S. pyogenes 
is by definition wild-type SLO, it is hemolytically 
active, and therefore must be handled with extreme 
20 caution. 



25 



30 



Recombinant DNA techniques for obtaining SLO 
fusion products offer the benefit of obtaining relatively 
large quantities of such products. Using such 
technology, it would be possible to avoid the tedious and 
cost-ineffective aspects of obtaining SLO from S. 
pyogenes, m the co-pending application referenced 
above, SLO derivatives are disclosed. As used herein, 
the term "SLO derivative- is an SLO fusion product which 
is soluble, hemolytically active and which is recognized 
by ASO. SLO derivatives are designated herein as »rSLO» 
These SLO derivatives are provided in large quantities, 
are substantially pure, and maintain hemolytic activity. 
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It would be beneficial to obtain 'variants" of 
SLO derivatives obtained from such recombinant DNA 
techniques. As used herein, the terms "variant", 
"mutation" or "mutant" are synonymous. SLO variants are 
5 designated herein as "mSLO" . An mSLO fusion product 
would provide several benefits, most notably in the 
manufacturing of immunodiagnostic assays. This is because 
such an MSLO fusion might be created which would be 
recognized by ASO antibodies, but would not have 
10 associated therewith the hemolytic activity of wild-type 
SLO. 

SUMMARY OF THE INVENTION 

15 The present invention provides SLO variants. 

These variants, designated herein as "mSLO", comprise the 
following characteristics and are broadly defined 
thereby: (i) recognized by wild-type anti-streptolysin O 
antibodies (ASO), i.e. comprising at least one epitope 
20 characteristic of wild-type Streptolysin O; and (ii) 
substantially non-hemo lytic activity. As used herein, 
the term "recognized" means capable of being bound by at 
least one antibody directed against wild-type SLO; the 
phrase "substantially non-hemolytic activity" means an 
25 inability to lyse red blood cells at equivalent titers of 
wild-type SLO; and "wild-type SLO" is accorded the usual 
definition associated with such phrase, i.e., SLO that is 
naturally secreted by a capable bacterial source. "Wild- 
type SLO", by definition, does not include, e.g. , SLO 
30 fusion products derived via recombinant DNA techniques. 

Another aspect of the invention is directed to 
a method for making mSLO. The method comprises 
introducing a soluble, hemolytically active fragment of 
35 SLO (e.g., wild type or rSLO) in translational fused or 
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unfused form to a host capable of producing high levels 
of mutagenic activity, to obtain a mutation fragment 
candidate; introducing said mutation fragment candidate 
into an appropriate vector to obtain a plasmid; 
5 transfecting a host capable of expressing the mutation 
fragment candidate with the plasmid to obtain a mutation 
protein candidate; screening said mutation protein 
candidate for an absence of hemolysis of red blood cells. 
Preferably, screening of said candidate for solubility is 
10 also conducted. As used herein, the phrase "high levels 
of mutagenic activity means greater than about 10» to 
about io- 10 mutations per base replicated. 

Another aspect of the invention includes 
15 various kits, such as, for example, test kits for ASO 

titers, calibrator kits and control kits, a test kit for 
determining ASO titer comprises mSLO in either liquid or 
non-liquid form. Most preferably, the mSLO is in liquid 
form because this allows for ASO titer determination on a 
per-test sample basis. Calibrator and control kits 
comprise various concentrations of the mSLO such that 
calibration and control of ASO titer analysis can be 
accomplished. 



20 



25 



30 



A particularly useful mSLO in accordance with 
the present disclosure is designated herein as mSLO.3/6 
which, in a purified form, has a specifrc hemolytuc 
activity of 14 hemolytic units (»HU») per mg. 

BRIEF DESffP TPTT Qjj Qp TWB DRAWTMCfl 

Pig. 1 is the nucleic acid sequence of a most 
preferred embodiment of an SLO variant, designated 
mSLO.3/6; 



35 
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Fig. 2 is the amino acid sequence of mSLO. 3/6; 

Fig. 3 is the nucleic acid sequence of a most 
preferred embodiment of an SLO derivative, designated 
5 rSLO.3; 

Fig. 4 is the amino acid sequence of rSL0.3. 

Fig. 5 is the comparative results of a Western 
10 blot analysis of supernatant comprising 0.2/fl, 2.0/*l, 
lOjttl of rSLO.3 and mSLO.3/6, 

Fig. 6 is the result of gel electrophoresis of 
purified mSL0.3/6(A) and r SLO. 3(B); 

15 

Fig. 7 is the results of titer experiments for 
hemolytic activity of mSLO.3/6, rSl0.3 and commercially 
available SLO; and 



20 



25 



30 



35 



Fig. 8 is the result of Western blot analysis 
of rSLO.3 and mSLO.3/6 in different vectors. 

DETAILED DESCRIPTION OF 
PREFERRED EMBODIMENTS 

As used in this disclosure, Streptolysin o 
variants, or "mSLO", comprise the following 
characteristics and are broadly defined thereby: (i) 
recognized by wild-type antistreptolysin O antibodies 
(ASO) ; and (ii) substantially non-hemolytic activity. As 
used herein, the term "recognized" means capable of being 
bound by at least one antibody directed against wild-type 
SLO; the phrase "substantially non-hemolytic activity" 
means an inability to lyse red blood cells at equivalent 
titers of wild-type SLO; and "wild-type SLO" is accorded 
the usual definition associated with such phrase, i.e., 



SUBSTITUTE SHEET 



WO 93/05155 



PCT/US92/06380 



- 15 - 

SLO that is naturally secreted by a bacterial source 
capable of secreting such protein. Preferably , the mSLO 
has a percent wild-type SLO specific activity of less 
than about 1.5%, more preferably less than about 0.5%, 
5 and most preferably less than about 0.1%, based upon a 
wild-type SLO specific activity of 4xl0 5 hemolytic 
units/mg wild-type SLO. These values are relative; thus, 
if percent wild-type SLO specific activity is based upon 
a wild-type SLO specific activity of IxlO 6 hemolytic 
10 units/mg, the above values are decreased by a factor of 
2.5 (i.e., 1.5% becomes 0.6%; 0.5% becomes 0.2%; 0.1% 
becomes 0. 04%) . 

The preceding is detailed because the 
15 "specific activity" of wild-type SLO has been described 
as being as high as about IxlO 6 hemolytic units/mg, 
although specific activity of about 4xl0 5 hemolytic 
units/mg has also been described. Alouf , J. E. 
"Streptococcal Toxins (Streptolysin O, Streptolysin S, 
20 Erythrogenic Toxin) . " Pharmac. Ther. II: 661-717 (1980), 
which is incorporated herein by reference. Accordingly, 
because the reported "specific activity" of wild type SLO 
is elusive, the foregoing percentages accomodate this 
fact. 

25 

For convenience, as used herein, the term 
"vector" means a circular DNA macromolecule comprising at 
least one restriction site and at least one promoter 
gene. The term "plasmid" means a vector further 
30 comprising a portion of a genome of interest, including, 
inter alia, a gene. The term "host" means a cell capable 
of being transfected by a plasmid. 

As those in the art appreciate, most vectors 
35 are selected with respect to a desired outcome. For 
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example, in a commercial setting, high-level expression 
of the gene of interest will typically be pref erred such 
that vectors with an appropriate promoter conducive to 
such expression will be chosen; on the other hand, in a 
5 research setting, such high level expression may not be 
critical such that a vector having trans lational and 
transcriptional signals that are under the control of 
regulatory elements of the host may be appropriate. 
Accordingly, in selecting a vector appropriate for the 
10 desired outcome, it is often useful to concurrently focus 
on the promoter gene of the vector of interest. 

Promoter genes which achieve very high levels 
of mRNA production include, for example, p L , p^, and p^. 

15 This list is not intended, nor is it to be construed, as 
an exhaustive list. Rather these promoters are used as 
exemplars for purposes of the discussion to follow. 
Those in the art can readily select an appropriate vector 
having a desired promoter which can provide equivalent 

20 results vis-a-vis the listed promoters. 

For example, Pn is used in conjunction with T7 
RNA polymerase which synthesizes RNA at a rate several 
times that of E. coli RNA polymerase and which terminates 

25 transcription less frequently than E. coli RNA 

polymerase. T7 RNA polymerase is highly selective for 
initiation at its own promoter sequence; accordingly, it 
. does not initiate transcription from any sequences on E. 
coli DNA. Furthermore, T7 RNA polymerase is resistant to 

3 0 antibiotics such as rifampicin that inhibit E. coli RNA . 
polymerase. Therefore, the addition of rifampicin, for 
example, to cells that are promoting T7 RNA polymerase 
results in the exclusive expression of genes under the 
control of a T7 RNA polymerase promoter, i.e. , p^. 
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10 



Expression using the T7 RNA polymerase/p^ 
system relies upon (typically) a two-plasmid system: the 
first plasmid comprises the gene to be expressed and p^; 
the second plasmid comprises the gene for T7 RNA 
polymerase. The second plasmid, e.g. pGPl-2 (which 
comprises the gene for T7 RNA poylmerase; see Tabor and 
Richardson, Proc. Natl. Acad .sei. n.s.a. 82: 107 4- 
1078(1985)), can either permanently reside in E. coli or 
can be introduced into E. coli with a specialized phage, 
such as, e.g., an M13 vector (such as, e.g., mGPl-2, see 
Tabor and Richardson), or a X vector (such as, e.g., C E6, 
see Studier and Moffett, J.Mol.Bioi. . isg. 113-130(1986))' 
comprising the T7 RNA polymerase gene. 



15 



Typically, the second plasmid comprising the 
T7 RNA polymerase gene is under the control of a heat 
inducible E. coli promoter, i.e., by raising the 
temperature from, e.g., 30«>C to 42»c, the heat inducible 
E. coli promoter is switched on, which will in turn 
switch on the promoter of the first plasmid, thereby 
leading to the expression of, e.g., the gene of interest. 
Thus, when using a T7 RNA polymerase^ expression 
system, the E. coli system comprises a heat-inducible 
promoter, such as, for example lambda P L with a CI 857 
25 repressor. 



20 



Examples of vectors comprising p^ include, 
e.g., the pT7 series (pT7-5, pT7-6, and pT7-7, which are 
derivatives of P T7-1; see Tabor and Richardson, supra. 1 
30 and the pET series (see Studier et al., Method* R n , r m 
185:60 - 89(1990) ) . 

Another vector system comprises a p L promoter 
gene. The p L promoter is derived from the X bacteriophage 
35 and is one of the most powerful regulated E. coli 
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promoter. Transcription from p L can be fully repressed 
and therefore plasmids comprising p L can be stabilized by 
the X repressor, cl . This repressor is typically 
supplied by an E. coli host which comprises an integrated 
5 copy of a portion of the X genome. Such an E. coli host, 
referred to as an "E. coli lysogen" is characterized as 
follows: (i) it supplies the X regulatory proteins cl and 
N (an anti-termination function); and (ii) it does not 
provide lytic components that would normally lead to cell 
10 lysis. Accordingly, E. coli lysogens trans feet ed with 
plasmids comprising, e.g. , a gene of interest and p L , can 
be grown initially to high density without expression of 
the gene and subsequently induced to synthesize the 
protein under inactivation of the repressor. Examples of 
15 p L based vectors are described in, e.g., U.S. Patent No. 
4,925,799 ("pASl") , Shatzman and Rosenberg, "The pAS 
Vector System and Its Application to Heterologous Gene 
Expression in Eschericia coli . " Hep t a logy 7:305- 
355(1987), and Rosenberg et al. , "The Use of pKC30 and 
20 its Derivatives for Controlled Expression of Genes." 
Methods Enzvmol 101; 123-139(1983). 

The p^ promoter is a hybrid promoter based on 
the tac and lac promoters, de Boer, et al. "The tac 

25 promoter: A functional hybrid derived from the trp and 
lac promoters." Proc . Natl . Acad . Sci. USA 80:21-25(1983) ; 
see also, Amann, et al. "Vectors bearing a hybrid trp - lac 
promoter useful for regulated expression of cloned genes 
in Eschericia coli ." Gene 25:167-178 (1983) . Because p^ 

30 includes the lac operator region, it can be repressed by 
E. coli strains that overproduce the lac repressor, and 
be fully induced by addition of isopropyl /3-D- 
thiogaloctoside (IPTG) thereto. 
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All of the foregoing references are 
incorporated herein by reference. 



The choice of an appropriate vector /host 
5 system is within the realm of the particular needs of the 
artisan. A most preferred vector is based upon the p L 
promoter. Table I sets forth a representative (not 
exclusive) list of suitable vectors and hosts, as well as 
the sources thereof. 

10 



15 



20 



25 



30 



35 



Vector 
pBTacl DNA 

pBTac2 DNA 

PNH8A 

PNH16A 

PNH18A 

pPROK-1 

pEX2 

PUC19 

p33 
P 33 

P P L -Lamda 



TABLE I 

Host* 

JM101, JM105, 
JM107, JM109 

JM101, JM105, 
JM107, JM109 

D1210PH, D1210 

D1210PH, D1210 

D1210PH, D1210 

JM109 

N4830-1 

JM101, JM105, 
JM107, JM109 

AR120, AR58 

AR120, AR58 

N99cl + -N4830-1 



Source 
(1) 

(1) 

(2) 

(2) 

(2) 

(3) 

(3) 

(4) 

(5) 
(5) 
(6) 



* 

40 (1) 

(2) 
(3) 



InJZrLJ^Z (4) = Bethesd a Research 

BOEHRINOER MANNHEIM (5) = SMITHKLINE BECKMAN 

STRATAOBIIE CLONINO SYSTEMS NOW SMITHKLINE BEE CHAM 

CLOKETECH Laboratories , inc. (6) = Pharmacia LKB 
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For the following examples, the vectors pa33 
and pBTac2 DNA were utilized in conjunction with the host 
strains AR120 and JM105, respectively, for the subcloning 
(initially from pUC19 vector) and expression of mSLO.3/6. 

5 

EXAMPLES 

The following Examples directed to preferred 
embodiments are not intended, nor are they to be 
10 construed to be, limitations on the disclosure of the 
claims to follow. 

Example 1 

Preparation of Partially Digested 
15 Genomic streptolysin O DNA 

Genomic DNA was isolated from Streptococcus 
pyogenes (ATCC #10389) using the technique described in 
Kehoe, M. et al. Infect. Immun. , 55:3228-3232 (1987) 
20 (hereinafter "Kehoe, 198 7" J , which is incorporated herein 
by reference. Approximately Img of S. pyogenes DNA was 
obtained using this procedure (925 fig) . 

To 370/xl of S. pyogenes DNA (2.5/xg//il) was 
25 added 300/il of 10X High Salt Buffer (1.0M NaCl; lOOmM 
tris-hydroxyamino methane-chloride ("TRIS-Cl") , pH7. 5; 
lOOmM MgCl 2 ; and lOmM dithriothreotol ("DTT") ) , 2310/il of 
deionized H 2 0 and 20^1 of Bgl II (BRL, Gaithersburg, MD, 
Cat. # 5213SA) , for a final volume of 3000/Lil. This 
30 mixture was maintained at 37°C and incubated overnight. 

To this incubated mixture was added 3000^1 of 
Reagent A (250/xl phenol, 250/xl chloroform, 10/il isoamyl 
alcohol, IjUl /S-mercapthoethanol) . This mixture was 
35 agitated prior to centrifugation in order to separate the 
aqueous and the organic layer. The aqueous supernantant 
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was then precipitated with . 3M NaOAc and 95% ethanol. 
The precipitate was then redissolved in 250/xl TE (lOmM 
TRIS-C1, pH 7.5; ImM EDTA) and 25jul of 10X loading dye 
(0.2M EDTA; 50% glycerol; 0.25% xylene cyanol; 0.25% 
5 bromophenol blue) was added thereto, followed by 

electrophoresis on 1% agarose gel. The Bgl II partially 
digested S. pyogenes genomic DNA fragments were then 
evaluated according to size. 

10 As noted, SLO has an approximate molecular 

weight of 65,000 to 70,000 daltons. Each amino acid has 
an approximate molecular weight of 110 daltons, such that 
(conservatively estimating) a 70,000 dalton protein would 
be encoded by approximately 636 codons, or 1909 base 

15 pairs. Accordingly, the partially digested fragments of 
between about 2,000 to 2,500 base pairs (i.e., 2.0 to 2.5 
Kb) , as determined by the aforementioned gel 
electrophoresis method, were purified. The purified 
fragments were then resuspended in 150/Ltl of TE. For 

20 convenience, these are designated herein as "SLO 
inserts". 



Preparation of Streptolysin O 
25 Containing Plasmids 

The vector utilized was pUC19 (BRL, Cat. # 
5364SA) cut With Bam HI (BRL, Cat. # 5201SA) . 

3 0 To 1/xl of cut pUC19 vector was added 15/xl of 

the SLO inserts, 3/xl of 10X ligation buffer (660mM TRIS- 
Cl, pH 7.5; 50mM magnesium chloride; lOmM DTT; lOmM ATP). 
A final volume of 30jul was achieved by the addition of 
8/il of deionized H 2 0. To this mixture was added 2/Ltl of T4 

35 ligase (USB, Spg/pl); incubation thereof at room 

temperature proceeded overnight. For convenience, the 
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resulting material is designated as "SLO plasmid 
candidates" . 

5 Screening of SLO Plasmid Candidates 

Host cells E. coll strain JM105 were 
transformed with the SLO plasmid candidates as follows. 
A vial containing 3 00^1 of frozen JM105 competent cell 

10 was thawed, and 16.0/xl of the SLO plasmid candidates was 
added thereto. This admixture was incubated on ice for 
30min, followed by heat shock in a 37 °C water bath for 
2min. Thereafter, the transfected JM105 solution was 
added to 2ml of LB medium (lOg Bacto-tryptane ; 5g Bacto 

15 yeast extract; lOg NaCl; 1 liter deionized water; pH 7.5 
with sodium hydroxide), followed by shaking (200RPM) for 
30min at 37°C. Plating was thereafter accomplished on LB 
Ampicillin plates, followed by incubation overnight at 
37 °C; for convenience, these are designated "SLO 

2 0 tr ans f ormant s " . 

Screening was accomplished utilizing a unique 
procedure. Following overnight growth, the colonies were 
overlaid with 3ml of 2.5% washed rabbit red blood cells 

25 in 0.8% agarose in PBS/lOmM DTT, which was spread to 
cover the plates. After 40min of incubation at 37°C, 
colonies comprising SLO were surrounded by small zones of 
hemolysis. In order to confirm that. these colonies 
comprised SLO, a 25-mer oligonucleotide probe derived 

30 from nucleotides 670 through 694, inclusive, of the 

reported DNA sequence of SLO (see Kehoe, 1987) was used 
as a probe. The probe was prepared with a BioSearch 8600 
DNA synthesizer, and labelled with 32 P following the T4 
polynucleotide kinase procedure described in Maniatis et 

35 al. , Molecular Cloning , CSPL (1982}, pp. 122-126 
(hereinafter " Molecular Cloning ") . 
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The blood overlay screening technique proved 
to be an efficient and accurate method for rapidly 
screening the SLO expressed by the SLO transf ormants. 
Because a property of SLO is the ability thereof to lyse 
red blood cells, red blood cells from any source can be 
utilized, i.e., human, mouse, goat, rabbit, etc. Rabbit 
red blood cells are preferred due to the availability 
thereof. 



10 An SLO clone that led to the expression of 

protein which evidenced hemolytic activity and which 
hybridized with the 25-mer probe was designated "pUC19- 
SLO-B". For convenience, the non- vector DNA sequence 
thereof is designated herein as "rSLO-candidates" . 



15 



Example 4 

optimization of Expression and 
Determination of Solubility 



20 



In order to optimize the expression of rSLO- 
candidates, timed-digestion of rSLO-candidates using Bal- 
31 was accomplished. Additionally, and as previously 
noted, solubility of the expressed protein ab initio , 
i.e., without further chemical modification once 
25 expressed, is of import. This is because non-soluble SLO 
is by definition inactive. Accordingly, an anlysis was 
also made to determine if the exprersed protein was 
soluble, i.e. was located in a supernatant as opposed to 
a pellet, following centrifugation. 



30 



The pUC19-SL0-B was initially cut with BstE 
II (New England Bio Labs, Cat. # 162, 10 U//ul) as 
follows. To 20M1 of pUCl 9 -SLO— B (2.5/xg/ M l) was added 
40/xl of 10X High Salt Buffer, 335 M 1 deionized H 2 0, and 5/U 
35 of BstE II. This admixture was incubated at 60«c for 

2hrs, followed by extraction with 400/xl of Reagent A, and 
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precipitation with 44/il of 3M NaOAc (pH 4.8) in 888/xl of 
95% ethanol. The precipitant was then redissolved in 
40^1 H 2 0. Thereafter, 90/xl of H 2 0, 20/il of 10X Bal-31 
Buffer (120mM CaCl 2 ; 120mM MgCl 2/ 2.0M NaCl; 0.2M TRIS-C1, 
5 pH 8.0; 10mM EDTA) , and 50/il of lmg/ml Bovine Serum 
Albumin, was admixed with the redissolved precipitant. 
This was followed by the addition of lO/il of Bal-31 (New 
England Bio Labs, Cat. # 213, 100 U/ml) , for a total of 
210/il, followed by incubation at room temperature. To 

10 control the effects of Bal-31, 3 0/xl aliquots of the 210/xl 
total solution was removed at 30, 45, 60, 80, 105, 130 
and 160min post-Bal-31 addition r and these aliquots were 
each admixed in 3.3/il of 0.2M EGTA, followed by storage 
on ice. After preparation and storage of the last 

15 aliquot, all seven aliquots were pooled, extracted with 
230/Ltl of Reagent A, and precipitated with 23^1 of 3M 
NaOAc in 506/il of 95% ethanol. The precipitate was then 
redissolved in 75/il of H 2 0. 

20 A fill-in reaction followed by the addition 

of 5/il of 2.5mM dXTP, 10/xl of 10X Medium Salt Buffer 
CSOOmM NaCl; lOOmM TRIS-C1, pH 7.5; lOOmM MgCl 2 ; lOmM 
DTT) , and 10/il of lOOmM of DTT to 75/il of the redissolved 
precipitant, followed by the addition thereto of 6/xl of 

25 Klenow polymerase (5 U/m1) / and incubation at room 

temperature for 4hrs. This admixture was extracted with 
lOOjul of Reagent A, precipitation with llfxl of 3M NaOAc 
in 22/il of 100% ethanol, and resuspension of the 
precipitate in 40/xl of H 2 0. 

30 

Following the fill-in reaction, 17^1 of the 
resuspended precipitate was admixed with 3/Ltl of a linker 
comprising a Bam HI sequence (New England Bio Labs, Cat $ 
1021) and 5^1 of 5X linker ligation buffer (250mM TRIS- 
35 CI, pH 7.6; 50mM MgCl 2 ; 5mM DTT; 5mM ATP; 2.5% (w/v) PEG 
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8000 (J.T.Baker, Cat. # U222-09)). This was followed by 
the addition thereto of 2/^1 of T4 ligase (5 U/jUl) , and 
incubation thereof for 6hrs at room temperature. For 
convenience, the resulting material is referred to as 
5 "ca/ew" . 

E. coli strain JM105 was transformed with 
ca/ew as described above, followed by overnight growth as 
described above in Example 3. To determine if the 
10 plasmids comprised the Bam HI linker, to 40/xl of ca/ew 
(0.5Mg/Ml) was added 40/xl of 10X Medium Salt Buffer, and 
320/il deionized H 2 0. To this mixture was added 5(il EcoRI 
(BRL, Cat # 5202 SA, 10U/m1) , followed by incubation at 
37 °C for 2hrs. In order to ensure that the plasmid was 
15 cut, gel electrophoresis (1% agarose gel) was conducted; 
this resulted in a smear of different sizes, indicating a 
successful cut. To the cut plasmid was added 8/xl of 5M 
Naci, followed by 5/*l of Bam HI (10 U/fxl) . This mixture 
was incubated for 37°C for 2hrs. Determination of the 
20 size of the rSLO-candidate seguence subjected to Bal-31 
digestion was conducted by gel electrophoresis (1% 
agarose gel) . This resulted in a band of interest at 
about 1.2 to about 2.0 Kb which comprised rSLO- 
candidates. Thus, the initial fragments of 2.0 to 2.5 Kb 
25 which evidenced hemolytic activity had been signif icantly 
decreased in size. 

The band which comprised rSLO-candidate was 
cut from the gel and purified in 15^1 of TE such that 

30 rSLO- candidate was available for ligation in pUC19 vector 
previously cut with Bam HI and ECoRI. in order to 
accomplish such ligation, lOjul of the gel-purified rSLO - 
candidate was admixed with 4/ul of the previously prepared 
vector, 2jul of 10X ligation buffer, 2/xl of lOmM ATP, and 

35 2m1 of deionized H 2 0. To this admixture was then added 
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2/xl of T4 ligase, followed by incubation at room 
temperature for 6hrs. E, coll host cell strain JM105 was 
transformed by these plasmids as above, and active 
colonies were screened by the red-blood cell overlay 
5 method disclosed above. Active colonies were then 
selected r innoculated in LB Medium/ 10 0/ug/ml Ampicillin 
and grown overnight under the conditions described above. 

Following overnight growth, the cells were 
10 centrifuged for 5min at 8000RPM at 4°C, and the resulting 
pellet resuspended in 2ml of Reagent B (150mM KTaCl; 20mM 
TRIS, pH 7.0; ImM EDTA) . Thereafter, the resuspended 
cells were subjected to sonication for 2X 30 sec. on ice, 
followed by centrifugation at 9500RPM for 40min at 4°C 
15 using a Beckman JA2 0.1 centrifuge to obtain the expressed 
protein. 

At this stage, if the rSLO-candidate led to 
the expression of a soluble protein, that protein would 

20 be located in the supernatant. Accordingly, analysis was 
conducted for the presence of rSLO-candidate in the 
supernatant using standard Western blot protocols for 
determination of an antigenically active protein. The 
results of such Western blot analysis indicated that 

25 there was an SLO fusion product in the supernatant which 
was recognized by horse anti-SLO antibodies. One such 
fusion product was selected and designated H rSL0.3" For 
convenience, the DNA sequence leading to the expression 
of rSLO.3 is also referred to as rSL0.3. High level 

30 expression of rSL0.3 was thereafter attempted. 
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Example § 
High Level Expression of rSLO.3 

Removal of rSL0.3 from the plasmid comprising 
5 pUC19 vector was effectuated as follows. To 15/il of the 
plasmid comprising rSLO.3 (0.5/xg/Ml) was added to 40^1 of 
10X Sma I Buffer (200mM KC1; lOOmM TRIS-C1, pH 8.0; lOOmM 
MgCLj; iomM DTT) , and 345^1 of deionized H 2 0. To this 
mixture was added 5jul Sma I (BRL, Cat # 5228 SA, 10U//il) , 
10 followed by incubation at 37 °C for 2hrs. In order to 
ensure that the plasmid was cut, gel electrophoresis (1% 
agarose gel) was conducted; this resulted in a single 
band, indicating a successful cut. To the cut plasmid 
was added 8jul of 5M NaCl, followed by 5^1 of Bam HI 
15 (10U//X1). This mixture was incubated for 37 °C for 2hrs. 
To ensure that the rSL0.3 sequence was successfully cut 
from the approximately 2.7Kb pUC 19 vector, gel 
electrophoresis (1% agarose gel) was conducted. This 
resulted in two bands, one at about 2.7Kb (the vector), 
20 and the other at about l.4Kb (rSLO.3). This band was cut 
from the gel and purified in 15/ttl of deionized H 2 0 such 
that rSLO.3 was available for ligation in pA33 vector 
previously cut with Bam HI and Sma I. 

25 To 2/il of above-derived rSLO.3 DNA was added 

2nl of the above described vector, 1.5/ul of 10X ligation 
buffer, 1.5^1 of lOmM ATP, and 8/zl of deionized H 2 0. To 
this admixture was added 2nl of T4 ligase (10 U/fil) , 
followed by incubation for 5hrs at room temperature. 

30 Such incubation resulted in plasmids comprising rSL0.3 
and pA33 vector. 

E. coli strain AR120 was transformed with 
the above-described plasmids in accordance with the 
35 procedure outlined for E. coli strain JM105. Thereafter, 
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a DNA mini prep described in Current Protocols in 
Molecular Biology . Auschel, F.M. et al. , Eds, John Wiley 
& Sons (New York) (1987) , Section 1.6., followed by 
cutting the plasmids with Bam HI and Sal I (BRL, Cat. # 
5 5217 SA) to determine if the plasmids comprised rSL0.3. 
Those host cells transformed with plasmids comprising 
rSL0.3 were then subjected to induction via the nalidixic 
acid protocol. See Mott, J.E. et al "Maximizing gene 
expression from plasmid vectors containing the Xp L 
10 promoter: Strategies for overproducing transcription 

termination factor p." PNAS USA . 82 : 88-92(1985), which 
is incorporated herein by reference. As those in the art 
appreciate, nalidixic acid, which damages DNA, induces 
recA protein, a recovery protein for E. coli. A 
15 derivative benefit vis-a-vis over express ion is that recA 
has protease activity, which, inter alia , leads to 
inactivation of XcX+ repressor; this inactivation leads 
to overexpression by the p L promoter. 

20 Specifically, colonies comprising the 

transformed AR120 were lifted from the agar plates and 
innoculated in Superboth (Base - 12g tryptone, 24g yeast 
extract, 5ml glycerol, 900ml distilled H 2 0; Salt (per 
liter of base) - l.7g KH 2 P0 4 , 15. 8g K 2 HP0 4 (anhydrous) 

25 100ml distilled H 2 0) plus 100Aig/ml ampicillin at 37 °C 
until the optical density of the medium at A^ equalled 
0.4. Thereafter, nalidixic acid was added to the 
innoculated mixture at a final concentration of 60/ig/ml 
and incubated at 37 °C for 4hrs. Western blot analysis of 
30 the supernatant demonstrated the presence of rSLO.3 
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Example g 
Mutation of rSLO.3 

As those in the art appreciate, organisms are 
equxpped with elaborate mechanisms that keep DMA mutation 
rates low, from a statistical point of view. There are 
at least three steps required to prevent mutations from 
occurring during DNA replication: (i) selection of a 
nucleotide complementary to the template (i.e. the DNA 
10 macromolecule being replicated) by DNA polymerase (an 
enzyme which catalyzes the addition of a nucleotide to 
the 3' end of a DNA chain); (ii) removal of a 
noncomplementary base by an "editing nuclease" (i . 
enzyme capable of degrading, e.g., . nucleotide) ; 'a 
(xxx) correction of a mis incorporated nucleotide. 

As is further appreciated by the skilled 
artisan, "mutators" are a special class of mutations that 
render varxous genes unstable. Bacteria mutator strains 
20 for example, E. coli mutator strains, have been utilized 
to investigate the mechanisms by which organisms control 
thexr mutation rates. Accordingly, identification of 
specxfic genes within a strain of, e.g., E. coli, having 
^ mutatxons, allows for investigation of mutator activity 

The dam, mu^s, mtEi m ^ and 
xnvolved in the process of mismatch repair;— ordingly 
C011 Strains which in <^e defective versions of these 

30 c\Tmutato bit m ° derately hi9h MUtat ° r aCtivit ^ *»* = 
colx mutator genes cause specific changes in the DNA 

sequence bexng replicated. For example, the mutS3 

mutator strain of E. coli produces bi-directio^T 

transxtions and is apparently sensitive to base sequence 

35 TsZVi rr es one a:t base pair but — «* ~ 

a second A:T base pair located less than 50 nucleotides 
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away. The mutT mutator strain of E. coli is unique in 
displaying a strict specificity, i.e., only A:T - C:G 
transitions are induced. The mutD mutator strain of E. 
coli is also notable because it results in mutation 
5 frequencies of 10 3 to 10 5 times that of wild-type E. coli. 
Trans vers ions of A : T G:T, transitions of 
A:T <— - TzA, and A:T -* C:G substitutions are associated 
with the mutD strain. While the tendencies of certain 
mutator strains are known, predicting exactly when or 
10 where along the DNA macromolecule a mutation will take 
place cannot be reasonably predicted. 

In an effort to effectuate a mutation of 
rSLO.3 (having as a desired objective obtaining a mutated 

15 form thereof), the E. coli mutator strains mut D and mutT 
were utilized. A most preferred mutator strain is mutD. 
Vials containing 300/jl each of frozen competent cell mutD 
and mutT strains were thawed, and Ipl of the plasmids 
from Example 4 was added thereto. This admixture was 

20 incubated on ice for 30 min., followed by 

heat shock in a 37°C water bath for 2 rain. Thereafter, 
the transformed mutD and mutT host cells were added to 
2ml of LB medium, followed by shaking (200RPM) for 30 
min. at 37°C. Innoculation was accomplished as follows: 

25 to the incubated mixture was added 10ml of LB 

Medium/ 10 O^g/ml Ampicillin, followed by overnight growth 
at 37 °C. Innoculation was repeated 5 (five) times, 
followed by the DNA mini-prep procedure referenced above. 
This resulted in plasmids comprising rSLO.3 mutation 

30 sequence candidates. 

E. coli strain JM105 was thereafter 
transformed with these plasmids as described above, 
followed by plating and overnight growth as described 
35 above. Screening was accomplished using the unique 
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blood-overlay protocol disclosed above. Most of the 
resulting colonies were active (i.e. non-mutants) ; 
however, a few of the resulting colonies did not lead to 
lyses of the red blood cells; these are referred to 
5 herein as mSLO-candidates . The colonies comprising mSLO- 
candidates were picked for determination of the immuno- 
activity thereof. This was accomplished using standard 
Western blot protocols (horse ASO) . One of the mSLO- 
candidates which evidenced immuno-reactivity with horse 
10 ASO was selected for high level expression. 

Example 7 
High Level Expression of mSLO 

15 Having cloned a gene for mSLO, high level 

expression thereof using a p L based vector was 
accomplished. The vector was pA33; the E. coli host 
strain was AR120. 

20 Removal of mSLO from the plasmid comprising 

PUC19 vector was effectuated as follows. To 15^1 of the 
plasmid comprising mSLO (0.5/xg/Ml) was added 40/xl of 10X 
Sma I Buffer, and 345m1 of deionized H 2 0. To this mixture 
was added 5/xl Sma I, followed by incubation at 37 °C for 

25 2hrs. In order to ensure that the plasmid was cut, gel 
electrophoresis (1% agarose gel) was conducted; this 
resulted in a single band, indicating a successful cut. 
To the cut plasmid was added 8fil of 5M NaCl, followed by 
5^1 of Bam HI (10U//il). This mixture was incubated for 

30 37°C for 2hrs. To ensure that the mSLO sequence was 
successfully cut from the approximately 2.7Kb pUC 19 
vector, gel electrophoresis (1% agarose gel) was 
conducted. This resulted in two bands, one at about 
2.7Kb (the vector), and the other at about 1.4Kb. The 

35 DNA of within this band is referred to herein as 



SUBSTITUTE SHEET 



_ _ PCT/US92/06380 
WO 93/05155 

- 32 - 

"mSLO.3/6" and protein expressed by this DNA is also 
referred to as mSLO.3/6. This band was cut from the gel 
and purified in 15/xl of deionized H 2 0 such that mSLO.3/6 
was available for ligation in pA33 vector previously cut 
5 with Bam HI and Sma I. 

To 2 Ail of above-derived mSLO.3/6 was added 
2f il of the above described vector, 1.5/tl of 10X ligation 
buffer, 1.5*11 of lOmM ATP, and 8/tl of deionized H 2 0. To 
10 this admixture was added 2fil of T4 ligase (10 U/nl) , 
followed by incubation for 5hrs at room temperature. 
Such incubation resulted in plasmids comprising mSLO.3/6 
and pA33 vector. 

X5 e. coli strain AR120 was transformed by 

the above-described plasmids in accordance with the 
procedure outlined for E. coli strain JM105. Thereafter, 
a DNA mini prep as referenced above was performed, 
followed by cutting the plasmids with Bam HI and Sal I 

20 (BRL, Cat. # 5217 SA) to determine if the plasmids 

comprised mSLO.3/6. Those host cells transformed with 
plasmids comprising mSLO.3/6 were then subjected to the 
nalidixic acid and innoculation protocols described 
above. As with rSLO.3, Western blot analysis of the 

25 supernatant demonstrated the presence of mSLO.3/6. 

Example a 
specific Activity of mSLO.3/6 

30 Protein concentration and specific activity 

of non-purified mSLO.3/6 obtained from Example 7 was 
determined and compared with the protein concentration 
and specific activity of non-purified rSLO.3 immediately 
following nalidixic acid induction of each. 

35 
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Protein concentrations for both the rSLO.3 
and mSLO.3/6 crude extracts were derived using the BioRad 
Protein Assay method (Coomassie Blue G-250) . Nalidixic 
acid induced proteins mixtures were centrifuged at 
5 8000RPM for 5min at 4°C and the pellets resuspended in 
500/il sonication buffer (40mM TRIS, pH 7.5; ImM EDTA; ImM 
DTT; 200 mM MaCl) . The resuspended pellets were then 
sonicated for 2X 30sec on ice, followed by centrifugation 
at 12,000RPM for 40min at 4°C. Thereafter, 5/ul of the 

10 resuspended rSLO.3 and mSLO. 3/6 mixtures were analyzed 
for protein concentration (OD reading at A 595 ) , and the 
protein concentration of each was determined to be 
4.6/itg/Ml» Western blot analysis was conducted for each 
sample; the gel pattens for .2/xg (C,C), 2/zg (B,B'), and 

15 10/xg (A, A') of each protein (A, B, C=mSLO. 3/6; 

A ' , B ' , C ' =r SLO • 3 ; S=standard marker) were identical, as is 
evident from Figure 5. 

Specific activity comparisons were determined 
2 0 by serial dilutions of the above described crude extracts 
and addition thereto of washed rabbit red blood cells 
("RRBC") , followed by spectrophotometry reading (OD 
reading at A M ) . 5ml of fresh rabbit blood was washed 2X 
with 45ml of PBS including lOmM DTT, followed by 
25 centrifugation at 2000RPM for 5min at 4°C. Thereafter, 
1.125ml of the washed rabbit red blood celis ("RRBC") 
were drawn from the bottom of the tube and 48.875 of 
PBS/lOmM DTT was added thereto. This resulted in a 
solution comprising 2.2 5% RRBC. For the hemolytic 
30 assays, 500/xl of the 2.25% RRBC was added to 500/il of 1:2 
serially diluted rSL0.3 and mSLO.3/6 in PBS/lOmM DTT, 
followed by incubation at 37° for 30min. 



These serial dilutions were 
35 spectrophotometrically analyzed (OD readings at A^,) . 
This, analysis indicated that .2/xl of the diluted rSLO.3 
crude extract caused 50% hemolysis of the RRBC; .2^1 of 
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the diluted extract is equivalent to 2 pi of the extract 
itself. Accordingly,, the rSL0.3 crude extract evidenced 
one hemolytic unit (»HU») per two microliters, or 500 
HU/ml. The mSLO.3/6 crude extract had substantially no 
5 hemolytic activity, ive, lOOpl of the resuspended 
mSLO.3/6 culture caused less than 6% hemolysis of the 
RRBC, or .2 HU/ml. Accordingly, there is 2,700 times 
less hemolytic activity associated with mSLO.3/6 compared 
to rSLO.3. 

10 

As noted, the protein concentration of the 
crude extracts was determined to be 4.6mg/ml. 
Accordingly, the specific activity of rSL0.3 derived from 
the pA33-AR120 expression system was 108.7HU/mg, while 

15 the specific hemolytic activity of mSLO.3/6 derived from 
this system was .04HU/mg. It is noted that because these 
are values for a crude (i.e. non-purified) extract, these 
values are predicated upon total protein concentration of 
the extract. For a purified extract, the specific 

20 activity values increase. 

The DNA and amino acid sequences of mSLO.3/6 
was thereafter determined (Lark Sequencing Technologies, 
Houston TX) , and are presented in Figures 1 and 2 , 
25 respectively. In Figure 1, the first codon (ATG) was 
donated by the pA33 vector, and the second codon (GAT) 
was from the Bam HI linker. For comparative purposes, the 
DNA and amino acid sequencing of rSL0.3 are presented in 
Figures 3 and 4. 

30 

There is a single codon difference between 
the sequences of rSL0.3 and m SLO. 3/6 at codon 487. In 
rSL0.3, this codon is AAA (Lya amino acid) -while in 
mSLO.3/6, this codon is GAA(Glu amino acid). 

35 
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Example 9 
Recovery of mSLO.3/6 

The following procedure is for approximately 
5 200 grams of transformed host cells (i.e., approximately 
6 grams total protein) . 

Transfected host cells were resuspended in 
200mls of Reagent C (40mM THIS, pH 7,5; ImM EDTA; 0.1% 2- 
10 mercaptoethanol) , followed by the addition of lOOmM PMSF. 
Thereafter, the cells were disrupted by sonication, 
followed by the addition of 4ml of lOOmM PMSF. This 
admixture was centrifuged for 30min at 4°C at 15,000RPM. 

15 The resulting supernatant was removed and 

saved; 200ml of Reagent C was added to the pellet, 
followed by the addition of 4mls of lOOmM PMSF- The 
resuspended pellet was then sonicated, followed by 
centrifugation as above. The resulting supernatant was 

20 then removed and pooled with the previous supernatant, 
and the pH thereof was adjusted to 7.0 with NaOH. 

To the final volume of supernatant was slowly 
added (with stirring at room temperature) Polymin P 
25 (Aldrich Chemicals) to a final concentration of 0/75%. 
This admixture was then centrifuged for 30min at room 
temperature at 10,000 RPM, followed by retrieval of the 
supernatant. Solid sodium sulfate was slowly added with 
stirring to 80% saturation of the supernatant. 

30 

Thereafter, the admixture was stirred for 
2hrs at 4°C, followed by centrigation for 30min at 4°C at 
15,000 RPM. The pellet was then retrieved and 
resuspended in 400mls of saturated ammonium sulfate, pH 
3 5 7.0. The admixture was then centrifuged for 3 0min at 4°C 
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at 10,000 RPM, followed by retrieval of the pellet and 
resuspension thereof in 200mls in Reagent D (20mM TRIS, 
pH 7 ; ImM EDTA; 0.1% 2-mercaptoethanol) . 

5 The resuspended pellet was then dialyzed 

against 2 liters of Reagent D, with 4 changes, at 4°C. 
Sufficient room was left in the dialysis bag in that the 
volume of the sample increases. Following dialysis, the 
pH of the sample was checked, and adjusted to 7.0 with 
10 NaOH. 

The sample was then loaded onto a Pharmacia 
Fast Flow S-Sepharose column eguilibrated in Reagent D. 
A 400ml bed volume was found to be sufficient to remove 
15 the mSLO.3/6 from the sample. The flow through, 

comprising E. coli proteins, was collected and discarded, 
and the column was washed with approximately 1 liter of 
Reagent D. 

20 The mSLO.3/6 was eluted with 2x1 liter 0.0 

to 0.4M NaCl gradient in Buffer B. The fractions were 
analyzed by SDS acrylamide gel (9%) , and fractions with 
high amounts of mSLO. 3/6 were pooled. Approximately 
250ml of pooled mSLO.3/6 was recovered. 

25 

Using the above procedure, approximately 60% 
of the original total protein (i.e. approximately 0.36 
grams) was mSLO.3/6, which can be stored at 4°C until 
needed. 

30 

Example 10 
Purification of mSLO.3/6 

Purification of mSLO.3/6 was accomplished to 
35 a purity of at least 80% using the following protocol. 
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Approximately 600g of frozen cell paste 
derived in accordance with the protocol described in 
Example 9 was thawed (37*C) , resuspended in 3 liters of 
cold lysis buffer (40mM TRIS-C1, pH 7.0; 1mm EDTA; .1% 2- 

5 mercaptoethanol; 2M Nacl; 4»C) and sonicated for 60min at 
4°-io°c with a Heat Systems Ultrasonics Continuous Flow 
sonicator (Farmingdale, N.Y., No. W-385) . Thereafter, 
the material was centrifuged on a Beckman JA10 centrifuge 
at 9500RPM for 40min at 20° to 26 °C. Approximately 3 

3 liters of supernatant was retrieved. 



To the supernatant was added at 12.5% stock 
solution of Polymin P precipitant (Aldrich, Milwaukee, 
Wis.) to a final concentration of between 0.2 to 0.3%. 
15 The solution was then stirred for lhr at room temperature 
and the precipitate discarded. The pH of the liquid 
portion was then adjusted to 7.0 with NaOH. This liquid 
was then permitted to stand overnight at room 
temperature . 



20 



Thereafter, the solution was centrifuged as 
above, and a clear supernatant retrieved. The 
supernatant was then loaded onto a 1 liter phenyl- 
sepharose HIC column (Pharmacia, Piscataway, N.J.) at 
25 2ml/min. at room temperature. Thereafter, the column was 
washed with an elution buffer (20mM TRIS-C1, pH 7.0; lmM 
EDTA; 0.1% BME) at 7ml/min. Fractions were monitored by 
SDS-PAGE electrophoresis using the Pharmacia Phast-Page™ 
system. Protein concentration was determined with thsi 
3 0 BioRad Protein Assay Kit. Fractions containing protein 
were then pooled. 

The pooled fractions was then loaded onto a 1 
liter Blue Affinity Column (BioRad, Richmond, California) 
35 at 2ml/mm at room temperature, followed by washing using 
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the elution buffer described above at 2ml/min. at room 
temperature for two column volumes, 

Elution of bound protein was accomplished 
using an NaCL density gradient of 0.0 to 0.8M, pH 7.0. 
Fractions were monitored with the Phast-PAGE System and 
protein concentration determined with the BioRad Protein 
Assay Kit. A single peak was obtained at 0.3-0.4M on the 
NaCL density gradient. 



10 



Purity of the eluted mSLO.3/6 was evaluated 
using a Beckman DU 7500 spectrophotometer, based upon 
analysis of major band homogeneity obtained from gel 
electrophoresis (12% SDS-polyacrylamide) of six different 
15 amounts of the eluted mSLO.3/6 (16, 8, 4, 2, 1, .5^g 

mSLO.3/6) . The evaluated purity of mSLO.3/6 (and rSLO.3 
for comparative purposes) based upon major band 
homogeneity is set forth in Table 2: 



20 Table 2 

mSLO.3/6 and rS LO.3 (ua) Percent Homoaeneity 

mSLO. 3/6 rSLO.3 

25 

0.5 >99.0% >99.0% 

1.0 >99.0% >99.0% 

2.0 >99.0% 94.4% 

4.0 86.8% 82.4% 

30 8.0 85.1% 81.4% 

16.0 81.9% 80.1% 



The gel electrophoresis results are presented in Figure 6 
35 where A is mSLO.3/6 and B is rSLO.3. 



WO 93/05155 



PCT/US92/06380 



- 39 - 

Example 11 

Specif c Hemolytic Activity (Titer) 
of Purified mSLO.3/6 

5 

Figure 7 provides a photograph of a 96-well 
microtiter plate. The microtiter wells evidence the 
results of an analysis of the hemolytic activity (by 
titer) of rSLO.3; a commercially available wild-type 
10 version of SLO (Sigma, Product No. S-5265) ; and mSLO.3/6. 

For the hemolytic assay titers, 0.154g DTT was 
added to 1 titer of PBS. Fresh rabbit red blood cells 
(2.5%) were washed with PBS, followed by centrifugation 
15 at 2000RPM for 20min at 4°C; this protocol was followed 
three times. Initial dilution of samples was 1:50 in the 
aforementioned PBS -DTT solution based upon 500/il PBS-DTT 
solution to IOmI of the sample; a 1:2 dilution series was 
followed for 12 dilutions. 

20 

Referencing Figure 7, in well Al, 200^1 of 
diluted rSLO.3 was added thereto; in Bl, 200jul of diluted 
mSLO.3/6; in CI, 200/xl of diluted Sigma SLO. In the 
remaining wells, 100/il of PBS-DTT solution was added 

25 thereto. Thereafter, 10 O/il of the solution from Al was 
pipetted into A2; this procedure was repeated such that 
each well had 100/ul of solution mixture. Thereafter, 
using an 8 channel pipettor, 100/xl of the washed rabbit 
red blood cells were added to each well, followed by 

3 0 gently shaking and incubatim at room temperature for 2 0 
minutes. Thereafter, the plate was centrifuged at 500RPM 
in a Beckman J4.2 centrifuge for 5min at 4°C. 

Dilutions were as follows as represented in 
35 Table 3 for the microtiter wells shown in Figure 7: 
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Table 3 



5 



15 



25 



12 


.24 


6.0 


X 


io- 5 


11 


.49 


1.2 


X 


ID" 4 


10 


1.0 


2.4 


X 


io-» 


9 


2.0 


4.8 


X 




8 


3.9 


9.5 


X 


io- 4 


7 


7.8 


1.9 


X 


IO" 3 


6 


15.6 


3.8 


X 


io- 3 


5 


31.3 


7.6 


X 


io- 3 


4 


62.5 


.015 




3 


125 


.03 




2 


250 


.06 




1 


500 


.125 





A A' 

30 Units : Ml/ml 



In Figure 7, A, A' is rSL0.3, B,B' is mSLO.3/6, 
C,C is Sigma SLO, and D is a negative control (no SLO) . 
Dilutions for B,B, C,C' and D were identical to A,A' except 
35 that Sigma SLO was not diluted beyond 1.0 X 10° pi /ml and 
the negative control did not extend past D6. 

Plates were read by determining which well in the 
dilution range did not evidence lysis of the red blood 
40 cells. If lysis occurs, then the well is red; the lack of 
lysis is indicated by the pellet of red blood cells located 
at the bottom of each well (i.e. the "center" of each 
well) . 
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As is evident from Figure 7, no lysis occurred at 
any titer for mSLO.3/6, while lysis occurred at 
approximately the same titer for both rSLO.3 and Sigma SLO. 
Accordingly, mSLO.3/6 evidences substantially no hemolytic 
5 activity at any equivalent titer of either the commercially 
available SLO or rSL0.3. 

For determination of hemolytic activity , the 
concentration of purified mSLO.3/6 was determined 

10 (1.2mg/ml). A 1:16.7 titer of mSLO.3/6 was required to 
obtain greater than 50% lysis of 2.5% RRBC. Accordingly, 
the specific hemolytic activity of purified mSLO.3/6 is 
14HU/mg (16.7 -s- 1.2). For comparative purposes, the 
hemolytic activity of rSL0.3 was similarly determined. A 

15 l:25 f 600 titer of a .7mg/ml concentration of purified 
rSL0.3 was required to obtain greater than 50% lysis of 
2.5% RRBC. Accordingly, the specific hemolytic activity of 
purified rSLO.3 is 3.6 x 10 4 HU/mg. This evidences a 2,600 
fold difference in specific activity which correlates with 
20 the 2,700 fold difference obtained from the crude (non- 
purified) extracts of Example 8. 

Example 12 
Subcloning of mSLO.3/6 

25 

Having obtained, verified and sequenced mSLO.3/6, 
subcloning and expression thereof using another 
expression/ vector system was initiated. The vector, pBTac 
2 DNA (Boehringer Mannheim, Cat. No. 1081381, 10/xg) was cut 

3 0 with Hind III (BRL, Cat. No. 52 075A, 10 U/ml) by admixing 
30/xl of pBTac2 DNA (l/xg//xl) , 30/Ltl of 1 OX Medium Salt 
Buffer, 240/xl deionized H 2 0, followed by addition thereto of 
5/il of Hind III (BRL, Cat. # 5207 SA, 10U//il) . This 
admixture was incubated for 2hrs at 37 °C. Thereafter, the 

3 5 admixture was analyzed by agarose electrophoresis (1% 
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agarose gel) to determine if the vector had been 
successfully cut; a single band indicated that the cut had 
been successful. 

5 To the 3 05/11 admixture was added 300/xl of Reagent 

A. This admixture was then centrifuged for 5min at 
12,000RPM on a Beckman microcentrifuge , followed by 
retrieval of the upper liquid layer. To this liquid layer 
was added 33/il of 3K NaOAc (pH 4.8) and 660/il of ethanol, 
10 followed by precipitation overnight at -20°C. This was 
followed by centrifugation for lOmin at 12,000RPM on a 
Beckman microcentrifuge. The pellet was retrieved and 
dried by air. The dried pellet was then resuspended in 
150m1 of deionized water. 

15 

In order to blunt (fill-in) one end of the Hind 
III cut vector, the ISOjul solution comprising the 
resuspended pellet was admixed with 10/xl of 20X dNTP 
(2.5mM) , 20/il of 10X MSB and 20/il of lOOmM DTT. This was 

20 followed by the addition of 4pl of Klenow polymerase (New 
England Biolabs, Cat. No. 210, 5 U/ml) and incubation at 
room temperature for 7hrs. Thereafter, 300^1 of Reagent A 
was added to the incubated mixture, followed by 
centrifugation for 5min at 12,000RPM. The upper liquid 

25 layer was retrieved and precipitated as above. The dried 
pellet was then resuspended in 30/xl of deionized H 2 0. For 
convenience, the filled-in, Hind III cut vector is referred 
to as "vec.rb" . 

30 Thereafter, vec.rb was cut with Bam HI (BRL, Cat 

No. 5201 SA, 10 U/Mlj • To 3 Op! of vec.rb was added 30^1 of 
10X High Salt Buffer and 240^1 of deionized H 2 0. To this 
admixture was added 5/il of Bam HI, followed by incubation 
for 2hrs at 37 °C, To the incubated mixture was added 300^1 

35 of Reagent A^ followed by centrifugation as above. The 
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upper liquid layer was retrieved and precipitated as above. 
The dried pellet was then resuspended in 2 0/ul of deionized 
H 2 0. The resuspended pellet comprised Hind III cut, filled- 
in, Bam HI cut pBTac2 DNA. 

5 

The mSLO.3/6 removed from the plasmid described 
above as follows. To 40/il of the plasmid comprising 
mSLO.3/6 (l/xg/l/xl) was added 10X Smal Buffer, and 320^1 
deionized H 2 0. To this mixture was added 5/xl Sma I 

10 (10U//il), followed by incubation at 37°C for 2hrs. In 
order to ensure that the plasmid was cut, gel 
electrophoresis (1% agarose gel) was conducted; this 
resulted in a single band, indicating a successful cut. To 
the cut plasmid was added 8m1 of 5M NaCI, followed by 5/xl 

15 of Bam HI (10 U/^l) . This mixture was incubated for 37 °c 
for 2hrs. To ensure that the mSLO.3/6 sequence was 
successfully cut from the approximately 6.3Kb pA33 vector, 
gel electrophoresis (1% agarose gel) was conducted. This 
resulted in two bands, one at about 6.3Kb (the vector), and 

20 the other at about 1.4kB (mSLO.3/6). The 1.4Kb band was 
cut from the gel and purified in 20/xl of deionized H 2 0 such 
that mSLO.3/6 was available for ligation in the prepared 
pBTac2 vector. 

25 To 3/*l of the vector was added 2fil of mSLO. 3/6, 

l.Spl of 10X Ligation Buffer (0.66M TRIS-Cl (pH 7.5), 50mM 
MgCl 2 , 50mM DTT, lOmM ATP), 1.5/xl of lOmM ATP and 7^1 of 
deionized H 2 0. Thereafter, 1.5/xl of T4 Ligase was added 
thereto, followed by incubation overnight at room 

30 temperature. For convenience, this mixture is referred to 
as the "subclone^ 1 . 

E. coli strain JM105 was transfected with 
subclone m as follows. A vial containing 3 00/xl of frozen 
35 JM105 competent cell was thawed, and 8.0^1 of subclone m was 



SUBSTITUTE SHEET 



WO 93/05155 



PCT/US92/06380 



- 44 - 

added thereto. This admixture was incubated on ice for 
30min, followed by heat shock in a 37 °C water bath for 
2min. Thereafter, the trans fected JM105 solution was added 
to 2ml of LB medium (log Bacto-trypitane ; 5g Bacto yeast 
5 extract; lOg NaCl; 11 deionized water; pH 7.5 with sodium 
hydroxide) , followed by shaking (200RPM) for 3 0min at 37°C. 
Plating was thereafter accomplished on LB Ampicillin 
plates, followed by incubation overnight at 37 °C. 

10 Because the resulting colonies comprised 

mSLO.3/6, the blood overlay screening protocol could not be 
utilized (i.e. , a non- viable colony could also evidence 
non-hemolytic activity) . Accordingly , screening was 
accomplished using hybridization probes obtained from 

15 pUC19-SLO-B cut with BstE II and EcoRV (approximately a 
1.1Kb probe), labelled by the random primer labelling 
method described in Anal. Biochem. , 132 6-13(1983). 

In order to screen these colonies , pure 

20 nitrocellulose transfer membranes (Schleicher & Schuell, 
Keene, NH, No. 20440; pore size: 0.45jLtm) were placed onto 
the plates, and colonies were lifted therefrom. The 
membranes were subjected to a denaturing buffer (0.5M NaOH; 
1.5M NaCl) for 5min at room temperature. This was followed 

25 by addition of a neutralizing buffer (1.5M NaCl; 1M TRIS , 
pH 8.0) . The membranes were then baked for 2hrs at 80 °C, 
followed by addition thereto of 12ml of hybridization 
buffer (5X SSPE; 50% deionized formamide; 5X Denhardts ; 
100/ig/ml denatured salmon testes DNA - see Molecular 

30 Cloning ) and prehybidized for Ihr at 37 °C. Thereafter, the 
above-described probes were added therto, and hybridized 
overnight at 37 °C. This was followed by three washings 
(lOmin each) with lOmM KP0 4 , pH 7.0, ImM EDTA. The 
membranes were then exposed to x-ray film overnight at - 

35 70 °C. Areas indicative of successful hybridization were 



SUBSTITUTE SHEET 



WO 93/05155 PCT/US92/06380 

- 45 - 

then used to obtain colonies comprising the mSLO.3/6 
subclones. 

Screened colonies comprising mSLO.3/6 subclones 
5 were innoculted in 12ml of Superbroth-ampicillin broth. 
Induction was accomplished by the addition of isoprpyl-0-D- 
thiogalactopyranoside ("IPTG"), at a final concentration of 
lmM, to the culture broth when the culture broth had an OD, 
reading of 0.7. 12ml of the resulting solution was 
10 centrifuged at 8000RPM for lOmin at 4°C and the resulting 
pellet resuspended in 1.2ml of PBS/lOmM DTT. The 
resuspended pellet was sonicated for l.Smin; the protein 
concentration of the sonicated extract was determined using 
the BioRad Protein Assay protocol described above. For the 
15 extract comprising mSLO.3/6, the protein concentration was 
I3mg/ml. This data was used to determine the specific 
hemolytic activity of the sonicated extract by titer based 
upon the 50% lyses of 2.5% washed rabbit red blood cell 
protocol described above. 



600 



20 



For purposes of comparing the specific hemolytic 
activity of mSLO.3/6 with that of rSL0.3 using the P BTac2 - 
JM105 expression system, rSLO.3 was also obtained using 
this expression system following the specific procedure 
25 described above (derived protein concentration of extract 
comprising rSL0.3 - 9.3mg/ml). Results are provided in 
Table 3: 
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Sonicated Extract 
Leal 



Table 3 

>S0% Lysis (+) 
<50% Lvsis (-) 



Hemolytic Units 
per mq extract 



10 



15 



20 



260. 
520. 

mSLO.3/6 1040. 

2080 



rSL0.3 



0.186 
0.372 
0.744 
1.116 
4.650 



(") 
(") 
(~) 
(-) 

(-) 
(+) 
(+) 
(+) 
(+) 



4.8 X 10 



2.69 x 10 3 



* - Estimated based upon 25% lysis at endpoint which 
contains 2080/ig extract per ml of buffer 



The foregoing data indicates that for the 
pBTac2 - JM105 expression system, the crude extract 
25 comprising mSLO.3/6 is approximately 5,600 times less 
hemolytically active than the crude extract comprising 
rSL0.3. 



30 



Example 12 

Comparative Analysis of Expression Systems 



In an effort to compare the relative expression 
sytems vis-a-vis the expression of rSL0.3 and mSLO.3/6, 
Western blot analysis was conducted on the pellets and 
35 supernatants for the pA33 vector comprsing rSLO.3 and 

pBTac2 vectors comprising rSLO.3 and mSLO.3/6. Identical 
protocols were followed for side-by-side comparative 
studies; each transformed host was innoculated in 
Superbroth, and respective induction strategies were used, 
i.e., naladixic acid for pA33, and IPTG for pBTac2. Figure 



40 
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9 provides the results of the Western blots of the crude 
extracts from the various materials indicated (S=standard 
marker; A=pA33-rSLO. 3 pellet; A'=pA33-rSLO. 3 supernatant; 
B=pBTac2 -mSLO .3/6 pellet ; B ' =pBTac2 -mSLO .3/6 supernatant ; 
5 C=pBTac2-rSL0.3 pellet; C'=pBTac2-rSLO. 3 supernatant.) 

As is evident from Figure 9 itself, the pA33 
expression system appears to differ from the pBTac2 system 
in that the bands for SLO-material , while appearing in 

10 approximately the same horizontal lane as with the pBTac2 
system, appear to evidence a greater amount of SLO in the 
pellet compared to the pBTac2 system. In an effort to 
confirm these trends, the results of Figure 9 were analyzed 
by the aforemention Beckman DU 7000 spectrophotometer, and 

15 the resulting peaks for SLO-material (as a function of 
total peak area for both pellet and supernatant) for each 
system were compared. These results are summarized below 
in Table 4; 



25 



30 



20 TABLE 4 



pA3 3 -r SLO . 3 pBTac2 -r SLO . 3 pBTac2 -mSLO .3/6 

S P S P S P 



9.23 8.14 13.71 7.01 12.44 7.22 



B 53 47 66 34 63 37 



S - Supernantant 
P - Pellet 

A - Specific peak area relative to total peak areas for S 
lane and P lane 
35 B - Percentage of (S + P) 

These results indicate that for the pA33 system, 
more SLO is in the pellet compared to the pBTac2 system; 
40 nearly twice as much SLO material is in the pBTac2 

supernatant than in the pellet, while for the pA33 system, 
approximately the same amount of SLO material is in the 
supernatant and the pellet. 
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Example i* 

Analysis of Specific Hemolytic Activity 
of Purified mSLO.3/6 and r/SLO.3 

5 As noted, the specific activity and percent 

hemolytic activity of specific versions of rSLO and mSLO 
(purifed rSLO.3 and mSLO.3/6 respectively), based upon the 
"specific activity" of wild-type SLO, are as follows in 
Table 4: 



25 





TABLE 5 








Wild-Type 
SLO 


rSL0.3 


mSL0.3/6 


Specific Activity 
(Hemolytic Activity 
in Hemolytic Units/mg) 


a) lxlO 6 

b) lxlO 5 


3 . 6xl0 4 


14 


Percent Hemolytic 
Activity of Wild-Type 
SLO 


a) 100 

b) 100 


3.6 

9 


1.4x10* 
3 . 5X1 0- 3 



The foregoing indicates that mSLO.3/6 is about 
1.4xl0' 2 % less hemolytically active than wild-type SLO. 
I.e. r mSLO .3/6 has less than 1% of the hemolytic activity 
30 of wild- type SLO. 

Example 

In Vivo Toxicity Effects of mSLO.3/6 

35 In order to evaluate in vivo toxicity effects of 

mSLO.3/6, Balb/c mice were administered undiluted and 
diluted intravenous injections of mSLO. 3/6. Undiluted and 
diluted control suspension buffer was administered to an 
equivalent number of mice. To improve the intravenous 

40 injections r the mice were warmed under a heat lamp for 20- 
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30 minutes of pre-injection. Approximately 20 mice were 
used for each condition. 



For the undiluted mSLO.3/6, each mouse received 
5 an approximate dosage of 17 mg/kg, while for the diluted 
mSLO.3/6, each mouse received an approximate dosage of 1000 
Mg/kg. Control solution buffer did not affect the control 
mice. 



10 Aside from minor ruffling for several minutes 

after injection, none of the mice receiving either diluted 
or undiluted mSLO.3/6 showed any ill effects from the 
intravenous administrations. This data supports the 
specific activity data for purified mSLO.3/6, i.e., the 
15 non-hemolytic activity of mSLO.3/6 is evident in that the 
mice receiving both diluted or undiluted doses thereof did 
not expire or otherwise evidence any serious side effects. 

The foregoing Examples are directed to the 
20 generation of an SLO genomic library. As those in the art 
appreciate, another type of library which is much less 
complex than a genomic DNA library is a "complementary 
DNA", or "cDNA", library. cDNA is derived directly from 
mRNA; therefore, by definition, the cDNA library is 
25 comprised of regions of translation. Methods for deriving 
cDNA libraries based upon mRNA complementary to mSLO DNA 
are considered to be within the purview of the skilled 
artisan such that cDNA-based libraries for mSLO are a part 
of this disclosure. 

30 

PROTEIN FOLDING 



As the linear arrangement of nucleotides defines 
a specific codon, the arrangement, or sequence, of amino 
35 acids defines the protein, including the particular 
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* function thereof. However, while the particular amino acid 
sequence is important with respect to the identity of the 
protein, the particular three dimensional shape that the 
protein exhibits is of similar import. Such specificity in 
5 terms of shape, in essence, co-defines the properties of 
the protein because the shape of the protein enables the 
protein to specifically interact with other molecules that 
will only recognize that particular protein shape. 

10 Most proteins spontaneously fold into their 

correct shape as they are translated within the cell. By 
treating the protein with certain denaturing solvents , the 
protein can "unfold" into a flexible chain. When the 
denaturing agent is removed, portions of the the flexible 
15 chain may refold into their original conformation. This is 
because one of the most important factors governing the 
folding of a protein is the distribution of polar 
(hydrophilic, or "water-hating") and non-polar 
(hydrophobic, or "water-loving") side chains of the amino 
20 acids of that protein. Denaturing solvents interfere with 
the polarity of the amino acid side chains. The following 
amino acids have polar side chains; Asn; Gin; Ser; Thr; 
and Tyr. The following amino acids have non-polar side 
chains: Giy; Ala; Val; Leu; Iso; Pro; Phe; Met; Trp; and 
25 Cys. Amino acids with basic and acidic side chains are 
very polar. The following amino acids have basis side 
chains: Lys; Arg; and His. The following amino acids 
have acidic side chains: Asp and Glu. 

30 The environment in which proteins naturally exist 

is, by definition, a non-denaturing environment, which is 
most typically aqueous. Accordingly, the hydrophobic side 
chains of a protein tend to be pushed together in the 
interior of the protein molecule, which enables these to 

35 avoid contact with the aqueous environment. Polar side 
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chains, on the other hand, tend to arrange themselves near 
the outside of the protein molecule, where they can 
interact with water and other polar molecules. 

5 While the molecular mechanisms by which a linear 

DNA sequence is transcribed and translated into a precise 
amino acid sequence of the corresponding polypeptide is 
well understood, exactly how the polypeptide chain folds 
simultaneously and autonomously into its three-dimensional 
10 structure is not clearly understood. The real potential of 
synthetic DNA, i.e. DNA synthesized via recombinant 
techniques, will be realized in the area of 

of protein design. In order for this to be realized, 
however, the mechanism of protein folding will have to be 
15 more succinctly clarified. While the general problem of 
predicting protein structure from the sequence is elusive 
(principally because no rules have emerged that allow 
structure to be related to sequence) , it is clear that 
certain portions of the sequence are important to the 
structure and other portions are relatively unimportant 
from a structural point of view such that substitutions or 
modifications can be made at these portions. Accordingly, 
it is assumed that portions of the sequence of a protein ' 
contribute significantly to the stability of the folded 
25 protein structure. 



20 



While predicting a protein structure from the 
protein sequence is elusive, proteins, by definition, have 
unique three-dimensional structures which can be 
30 determined. The following methodologies, for example 
be used in the determination of protein structure: 
Crystallography; Optical Activity, Nuclear Magnetic 
Resonance Spectroscopy. 



can 
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a) crvstollaqraphv 

Proteins are capable of forming crystals. 
Proteins usually crystallize in a condition of saturation 
5 or super- saturation which can be achieved by altering one 
or more of a number of variables that affect the solubility 
of the proteins. Thus, by altering the ionic strength of 
the solution or by utilization of organic polymers, e.g., 
polyethylene glycol, proteins can be crystallized. 

10 Techniques for growing protein crystals are set forth in 
Narang, S.A. Protein Engineering: Approaches to the 
Manipulation of Protein Folding (Butterworth, Publisher, 
Stoneham MA. , 1990), Chpt. 6 (hereinafter "Narang"). The 
preceding text book is incorporated herein by reference in 

15 its entirety. Having crystallized the protein, the 

techniques of x-ray, neutron, and electron diffraction can 
be used to determine to structure of the protein, with x- 
ray diffraction being preferred. The protein structure in 
the crystal is assumed to be at or near the minimum 

20 conformational free energy of the molecule for the crystal 
form. 

b) optical Activity 

25 The optical activity of polypeptides /proteins due 

to the asymmetric centers of the amino acids and to the 
asymmetric conformations thereof, can be utilized to 
determine the structure of polypeptides/ proteins. This 
asymmetry causes proteins to interact differently with 

30 right- and left-circular ly polarized light; if the two 
beams consequently travel at different speeds through the 
protein, polarized light is rotated. Optical rotatory 
dispersion ("ORD") is the dependence of this rotation upon 
wavelength. In a wavelength region where the protein 

35 molecule does not absorb light, the rotation varies 
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gradually with wavelength, but in an absorbance region , the 
rotation first increases sharply in one direction, falls to 
zero at the absorption maximum, and then rises sharply in 
the opposite direction. There will also be unequal 
5 absorption of left- and right-circularly polarized light; 
this is referred to as circular dichroisra ("CD") . Both CD 
and OES spectra of a protein are very sensitive to the 
structural conformation thereof. Folded proteins generally 
have significant optical activity in the near-UV region 
10 (250-300nm) . 

c) Nuclear Magnetic Resonance Spectroscopy 

Nuclear Magnetic Resonance Spectroscopy, using, 
15 e.g., 1 H, 13 C, 15 N, 13 P or 2 H, has proven to be of great use in 
studying protein structure in solution. Focusing on 1H, 
each hydrogen atom in a molecule has a nuclear magnetic 
spin, i.e. the nuclei of the atom act like tiny magnets. 
In the absence of an external magnetic field, the magnetic 
20 moments of the protons are randomly oriented. In a Nuclear 
Magnetic Resonance experiment, a strong external magnetic 
field is applied to the sample along a specified direction, 
resulting in a net alignment of the magnetic moments and a 
net macroscopic magnetization along the specified 
25 directional axis; a short radio-frequency pulse of 
appropriate strength is then applied, knocking the 
magnetization vector away from this axis. As the 
magnetization recovers, a transient radio-frequency signal 
is recorded as a function of time. A f ourier-transf orm of 
30 this signal then yields a frequency spectrum. Each proton 
in the molecule gives rise to a peak in this spectrum 
occurring at some characteristic resonance frequency 
determined by the local electronic environment of that 
proton. The resonance frequency of a particular proton is 
35 called its "chemical shift" and is measured as an offset 
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from some reference frequency. Structured information from 
NMR is derived from the nuclear Overhauser effect ("NOE" , 
which determines whether a pair of protons are near each 
other in space) and the coupling constants of protons that 
5 are separated by three or fewer chemical bonds. NOE and 
coupling constants provide one-dimensional data; two- 
dimensional data is provided by inter alja nuclear 
Overhauser enhancement spectroscopy (NOESY) and two- 
dimensional correlation spectroscopy (COSY); and from such 
10 data, three-dimensional protein structures can be 
determined. 

In view of the foregoing information set forth 
with respect to determination of the three-dimensional 
15 structure of protein molecules , the following claims 

directed to DNA macromolecules and amino acids inherently 
include the three dimensional structures associated with 
the protein molecules expressed thereby. 

20 The Examples herein are not to be construed as 

limited to specific vectors, plasmids and host cells which 
are preferred. The mSLO described herein is not to be 
construed as limited solely to the preferred mSLO 
designated mSLO . 3 / 6 . Similarly, the preferred mSLO .3/6 in 

25 no way constitute an admission, either actual or implied, 
that the DNA and amino acid sequences thereof are the only 
DNA and amino acid sequences to which Applicant is 
entitled. Applicant is entitled to the full breadth of 
protection tinder applicable patent laws. 

30 

For purposes of claiming materials by 
designation, JM105 transformed with plasmids comprising 
pBTac2 DNA - mSLO.3/6 and AR120 transformed with vectors 
comprising pA33 - mSLO.3/6 were deposited on August 23, 
35 1991 with the American Type Culture Collection (ATCC) , 
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12301 Parklawn Drive, Rockville, Maryland, 20852, under the 
provisions of the Budapest Treaty for the International 
Recognition of the Deposit of Microorganisms for the 
Purpose of Patent Procedure. These were tested by the ATCC 
5 on August 27, 1991, and determined to both be viable. The 
ATCC has assigned the deposit numbers ATCC 68678 and ATCC 
68676, respectively, to these materials. 

Having identified a single nucleotide (and amino- 
10 acid) difference between rSLO.3 and mSLO.3/6, those skilled 
in the art can readily prepare the DNA sequence set forth 
in Figure 1 using various methodologies known to those in 
the art, e.g., wild-type SLO subjected to the process 
described in, e.g., United States Patent No. 4,351,901 
15 ("Method for single nucleotide alteration"), which is 

incorporated herein by reference. Other methodologies for 
nucleotide and amino acid substitution, terminal and 
intermediate alterations and deletions will be readily 
apparent to those skilled in the art. Furthermore, as the 
20 DNA synthesis art progresses such that oligonucleotides 
having the length of the DNA sequence of Figure 1 can be 
rapidly obtained, one can synthesize that sequence as 
appropriate with such advances in the art. 

25 Additionally, because Applicant has discovered, 

inter alia , that the single nucleotide substitution 
identified in Figure 1 (which in turn leads to a single 
amino acid change as set forth in Figure 2) 
has reduced the hemolytic activity of the SLO variant as 

30 described, those skilled in the art can readily obtain 
fragments of that DNA sequence (e.g., via nucleotide 
deletion) such that the fragment continues to maintain at 
least one epitopic site characteristic of wild-type SLO and 
maintains non-hemolytic characteristics. Furthermore, and 

35 as noted, conservative substitutions of nucleotides can be 
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made without concomitant changes in the amino acid 
sequence, as those in the art understand and appreciate. 
For example, "computerized back translation" techniques can 
be used, whereby the amino acid sequence is analyzed by a 
5 computer and the computer determines the optimum 

nucleotides to utilize in the codons necessary to encode 
such amino acids. Additionally, DNA sequences preferably 
evidencing 80% homology with the DNA sequence of Figure 1, 
preferably 85%, and most preferably 90% homology with that 
10 sequence are considered to fall within the scope of the 
invention. 

Because the screening of SLO variants can be 
readily accomplished using the above-described blood 

15 overlay technique, numerous SLO variant candidates can be 
rapidly evaluated. Accordingly, having defined the 
specific substitution that has led to the specific SLO 
variant, those skilled in the art can readily use this 
advance in the art to derive SLO variant analog candidates, 

20 rapidly screen these candidates for indications of non- 
hemolytic activity, and determine the nucleic acid and 
amino acid sequences of analogs which fall within 
Applicant's definition of mSLO. 

25 Accordingly: while the Examples herein are 

directed to a specific SLO variant, mSLO.3/6; because, 
statistically speaking, this variant has no hemolytic 
activity compared to wild-type SLO; and because having had 
this advance in the art placed in their possession, those 

30 in the art can utilize techniques known to the art to adapt 
this advance to their own ends, Applicant's invention is 
seen to comprehend SLO variants having the characteristics 
as defined, and is not limited to the specific variant 
disclosed in the Examples. 
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Although the present invention has been described 
in considerable detail with regard to certain preferred 
embodiments thereof, other embodiments within the scope of 
the teachings of the present invention are possible. As 
5 such, while the production of a specific SLO variant has 
been described in detail, this is to be construed as an 
exemplar. Accordingly, neither the disclosure, nor the 
claims to follow, are intended, nor should be construed to 
be, limited by the descriptions of the preferred 
10 embodiments contained herein. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Adams, Craig M. 

(ii) TITLE OF INVENTION: Streptolysin O Variants 
10 (iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 



15 



20 



(A) 


ADDRESSEE: 


Beckman instruments, Inc. 


(B) 


STREET: 


2500 Harbor Blvd. 


(C) 


CITY: 


Fullerton 


(D) 


STATE: 


California 


(E) 


COUNTRY: 


USA 


(F) 


ZIP: 


92634 



(V) COMPUTER READABLE FORM: 



(A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 Mb 

(B) COMPUTER: IBM 

25 (C) Operating System: MS . DOS 

(D) SOFTWARE: WordPerfect 5.1 

(vi) CURRENT APPLICATION DATA: 

30 (A) APPLICATION NUMBER: 07/752,428 

(B) FILING DATE: August 30, 1991 

(C) CLASSIFICATION: 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) REGISTRATION NUMBER: 
5 (C) FILING DATE: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Burgoon, Richard P. 
10 (B) REGISTRATION NUMBER: 34,787 

(C) REFERENCE /DOCKET NUMBER: 128D-1028 

(ix) TELECOMMUNICATION INFORMATION: 

15 (A) TELEPHONE: (714) 773-7610 

(B) TELEFAX: (714) 773-7936 

(2) INFORMATION FOR SEQ ID NO. : 1: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1524 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: genomic DNA 

(Vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Streptococcus pyogenes 
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(Vii) 



IMMEDIATE SOURCE: 



(A) 
(B) 



LIBRARY: genomic 
CLONE: mSLO.3/6 



5 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO. : I: 



ATG GAT CCG TCA GAA GAG AAA AAA AAG AGC GAA GAA GAT CAC ACT GAA 



48 



GAA ATC AAT GAC AAG ATT TAT TCA CTA AAT TAT AAT GAG CTT GAA GTA 96 

CTT GCT AAA AAT GGT GAA ACC ATT GAA AAT TTT GTT CCT AAA GAA GGC 144 

GTT AAG AAA GCT GAT AAA TTT ATT GTC ATT GAA AGA AAG AAA AAA AAT 192 

ATC AAC ACT ACA CCA GTC GAT ATT TCC ATC ATT GAC TCT GTC ACT GAT 24< 

AGG ACC TAT CCA GCA GCC CTT CAG CTG GCT AAT AAA GGT TTT ACC GAA 288 

AAC AAA CCA GAC GCG GTA GTC ACC AAG CGA AAC CCA CAA AAA ATC CAT 336 

ATT GAT TTA CCA GGT ATG GGA GAC AAA GCA ACG GTT GAG GTC AAT GAC 384 

CCT ACC TAT GCC AAT GTT TCA ACA GCT ATT GAT AAT CTT GTT AAC CAA 432 

TGG CAT GAT AAT TAT TCT GGT GGT AAT ACG CTT CCT GCC AGA ACA CAA 480 

TAT ACT AAA TCA ATG GTA TAT TCT AAG TCA CAG ATT GAA GCA GCT CTA 528 

AAT GTT AAT AGC AAA ATC TTA GAT GGT ACT TTA GGC ATT GAT TTC AAG 576 

TCG ATT TCA AAA GGT GAA AAG AAG GTG ATG ATT GCA GCA TAC AAG CAA 624 

ATT TTT XAC ACC gta TCA GCA AAC CTT CCT AAT AAT CCT GCG GAT GTG 672 

TTT GAT AAA TCA GTG ACC TTT AAA GAG TTG CAA CGA AAA GGT GTC AGC 720 

AAT GAA GCT CCG CCA CTC TTT GTG AGT AAC GTA GCC TAT GGT CGA ACT 768 

GTT TTT GTC AAA CTA GAA ACA AGT TCT AAA AGT AAT GAT GTT GAA GCG 816 

GCC TTT AGT GCA GCT CTA AAA GGA ACA GAT GTT AAA ACT AAT GGA AAA 864 
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TAG TCT GAT ATC TTA GAA AAT AGC TCA TTT ACA GCT GTC GTT TTA GGA 912 
GGA GAT GCT GCA GAG CAC AAT AAG GTA GTC ACA AAA GAC TTT GAT GTT 960 

ATT AGA AAC GTT ATC AAA GAC AAT GCT ACC TTC AGT AGA AAA AAC CCA 1008 
GCT TAT CCT ATT TCA TAC ACC AGT GTT TTC CTT AAA AAT AAT AAA ATT 1056 

GCG GGT GTC AAT AAC AGA ACT GAA TAC GTT GAA ACA ACA TCT ACC GAG 1104 

TAC ACT AGT GGA AAA ATT AAC CTG TCT CAT CAA GGC GCG TAT GTT GCT 1152 

CAA TAT GAA ATC CTT TGG GAT GAA ATC AAT TAT GAT GAC AAA GGA AAA 1200 

GAA GTG ATT ACA AAA CGA CGT TGG GAT AAC AAC TGG TAT AGT AAG ACA 1248 

TCA CCA TTT AGC ACA GTT ATC CCA CTA GGA GCT AAT TCA CGA AAT ATA 1296 

CGT ATC ATG GCT AGA GAG TGC ACC GGC TTA GCT TGG GAA TGG TGG CGA 1344 

AAA GTG ATC GAC GAA AGA GAT GTG AAA CTG TCT AAA GAA ATC AAT GTC 1392 

AAC ATC TCA GGA TCA ACC CTG AGC CCA TAT GGT TCG ATT ACT TAT AAG 1440 

TAG GAC TGG TTC AAG AGG TTC GTC AAG CAC CTT GAT GCT GCT TAT CTC 1488 

TTG AGA TCC CCG GGT AGG CCT AGT TAA CTA GTC GAC 1524 

(3) INFORMATION FOR SEQ ID NO. : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 amino acids 

(B) TYPE: Amino Acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULAR TYPE: protein 
(ix) FEATURE: 

(A) NAME/KEY: Signal sequence 
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(B) LOCATION: amino acid 98 to amino acid 
571 of SLO (except for mutation at amino 
acid 304 of following sequence) 

(C) IDENTIFICATION METHOD: experimentally 
determined based upon production of 
soluble, non-hemolytically active SLO 
from recombinant vector 

(D) OTHER INFORMATION: Does not lyse red 
blood cells 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO.: 2: 



Met Asp Pro Ser Glu Asp Lys Lys Lys Ser Glu Glu Asp His Thr Glu 
-2 -1 5 10 

Glu He Asn Asp Lys lie Tyr Ser Leu Asn Tyr Asn Glu Leu Glu Val 
15 • 20 25 30 

Leu Ala Lys Asn Gly Glu Thr lie Glu Asn Phe Val Pro Lys Glu Gly 

35 40 45 

Val Lys Lys Ala Asp Lys Phe He Val lie Glu Arg Lys Lys Lys Asn 
50 55 60 

He Asn Thr Thr Pro Val Asp He Ser lie lie Asp Ser Val Thr Asp 
65 70 75 

Arg Thr Tyr Pro Ala Ala Leu Gin Leu Ala Asn Lys Gly Phe Thr Glu 
80 85 90 

Asn Lys Pro Asp Ala Val Val Thr Lys Arg Asn Pro Gin Lys He His 
95 100 105 HO 

He Asp Leu Pro Gly Met Gly Asp Lys Ala Thr Val Glu Val Asn Asp 

115 120 125 
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Pro Thr Tyr Ala Asn Val Ser Thr Ala lie Asp Asn Leu Val Asn Gin 
130 135 140 

Trp His Asp Asn Tyr Ser Gly Gly Asn Thr Leu Pro Ala Arg Thr Gin 
145 150 155 

Tyr Thr Lys Ser Met Val Tyr Ser Lys Ser Gin lie Glu Ala Ala Leu 
160 165 170 

Asn Val Asn Ser Lys lie Leu Asp Gly Thr Leu Gly lie Asp Phe Lys 
175 180 185 190 

Ser lie Ser Lys Gly Glu Lys Lys Val Met lie Ala Ala Tyr Lys Gin 

195 200 205 

lie Phe Tyr Thr Val Ser Ala Asn Leu Pro Asn Asn Pro Ala Asp Val 
210 215 220 

Phe Asp Lys Ser Val Thr Phe Lys Glu Leu Gin Arg Lys Gly Val Ser 
225 230 235 

Asn Glu Ala Pro Pro Leu Phe Val Ser Asn Val Ala Tyr Gly Arg Thr 
240 245 250 

Val Phe Val Lys Leu Glu Thr Ser Ser Lys Ser Asn Asp Val Glu Ala 
255 260 265 270 

Ala Phe Ser Ala Ala Leu Lys Gly Thr Asp Val Lys Thr Asn Gly Lys 

275 280 285 

Tyr Ser Asp lie Leu Glu Asn Ser Ser Phe Thr Ala Val Val Leu Gly 
290 295 300 



SUBSTITUTE SHEET 



WO 93/05,55 PCT/US92/06380 

- 64 - 

Gly Asp Ala Ala Glu His Asn Lys Val Val Thr Lys Asp Phe Asp Val 
305 310 315 

lie Arg Asn Val lie Lys Asp Asn Ala Thr Phe Ser Arg Lys Asn Pro 
320 325 330 

Ala Tyr Pro lie Ser Tyr Thr Ser Val Phe Leu Lys Asn Asn Lys He 
335 340 345 350 

Ala Gly Val Asn Asn Arg Thr Glu Tyr Val Glu Thr Thr Ser Thr Glu 

355 3 60 365 

Tyr Thr Ser Gly Lys He Asn Leu Ser His Gin Gly Ala Tyr Val Ala 
370 375 380 

Gin Tyr Glu lie Leu Trp Asp Glu lie Asn Tyr Asp Asp Lys Gly Lys 
385 390 395 

Glu Val He Thr Lys Arg Arg Trp Asp Asn Asn Trp Tyr Ser Lys Thr 
4 °0 405 410 

Ser Pro Phe Ser Thr Val lie Pro Leu Gly Ala Asn Ser Arg Asn lie 
415 420 425 430 

Arg He Met Ala Arg Glu Cys Thr Gly Leu Ala Trp Glu Trp Trp Arg 

435 440 445 

Lys Val He Asp Glu Arg Asp Val Lys Leu Ser Lys Glu He Asn Va l 
450 455 460 

Asn He Ser Gly Ser Thr Leu Ser Pro Tyr Gly Ser He Thr Tyr Lys 
465 470 475 
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What is claimed is: 



1. A purified and isolated DNA sequence consisting 
essentially of a DNA sequence capable of expressing 
a soluble variant of Streptolysin O evidencing 
substantially no hemolytic activity. 

2. The DNA sequence of claim 1 wherein the variant 
of Streptolysin 0 is characterized by evidencing 
less than about 1.5% of the hemolytic activity of 
wild-type Streptolysin 0, said wild-type 
Streptolysin 0 having a hemolytic activity of about 

4 x 10 5 hemolytic units per milligram. 

3. The DNA sequence of claim l wherein the variant 
of Streptolysin 0 is characterized by evidencing 
less than about 0.5% of the hemolytic activity of 
wild-type Streptolysin 0, said wild-type 
Streptolysin 0 having a hemolytic activity of about 

4 x 10 s hemolytic units per milligram. 

4. The purified and isolated DNA sequence of claim 
1 wherein the variant of Streptolysin o is 
characterized by evidencing less than about 0.1% of 
the hemolytic activity of wild-type Streptolysin O, 
said wild-type Streptolysin O having a hemolytic 
activity of about 4 x 10 s hemolytic units per 
milligram. 

5. A DNA sequence encoding a variant version of 
Streptolysin 0 comprising a DNA sequence that is at 
least 80% homologous with the DNA sequence as set 
forth in Figure 1 from nucleotide 7 through 
nucleotide 1524, inclusive. 
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6. A DNA sequence encoding a variant version of 
Streptolysin O comprising the DNA sequence as set 
forth in Figure X from nucleotide 7 through 
nucleotide 1524, inclusive. 

7. A prokaryotic or eukaryotic host cell 
transformed or transfected with a DNA sequence 
according to claim 1, 2 , 3, 4, 5, or 6 in a manner 
allowing the host cell to express a varaint version 
of Streptolysin 0. 

8. A DNA plasmid comprising a DNA vector and a DNA 
sequence according to claim 1, 2, 3, 4, 5, or 6. 

9. A prokaryotic or eukaryotic host cell 
transformed or transfected with a DNA plasmid 
according to claim 8. 

10. The DNA plasmid of claim 8 wherein said DNA 
vector further comprises a promoter. 

11. The DNA plasmid of claim 8 wherein said DNA 
vector further comprises a regulated promoter. 

12. The DNA plasmid of claim 8 wherein said DNA 
vector further comprises a regulated promoter 
selected from the group consisting of p L , p^, and 

PT7- 

13* A purified and isolated DNA sequence consisting 
essentially of a DNA sequence encoding a polypeptide 
having an amino acid sequence of that of a variant 
of Streptolysin O, said variant of Streptolysin 0 
characterized by being soluble upon expression and 
evidencing substantially no hemolytic activity. 
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14. The purified and isolated DNA sequence of claim 
13 wherein said variant of Streptolysin 0 is 
characterized by evidencing less than about 1.5% of 
the hemolytic activity of wild-type Streptolysin 0, 
said wild-type Streptolysin 0 having a hemolytic 
activity of about 4 x 10 s hemolytic units per 
milligram. 

15. The purified and isolated DNA sequence of claim 
13 wherein said variant of Streptolysin O is 
characterized by evidencing less than about 0.5% of 
the hemolytic activity of wild-type Streptolysin o, 
said wild-type Streptolysin O having a hemolytic 
activity of about 4 x 10 5 hemolytic units per 
milligram. 

16. The purified and isolated DNA sequence of claim 
13 wherein said variant of Streptolysin O is 
characterized by evidencing less than about 0.1% of 
the hemolytic activity of wild-type Streptolysin o, 
said wild-type Streptolysin 0 having a hemolytic 
activity of about 4 x 10 5 hemolytic units per 
milligram. 

17. An amino acid sequence encoding a variant of 
Streptolysin O comprising amino acids 1 through 473, 
inclusive, of the amino acid sequence set forth in 
Figure 2. 

18. A cDNA sequence according to claim 13. 

19. A prokaryotic or eukaryotic host cell 
transformed or transfected with a DNA sequence 
according to claim 13, 14, 15, 16, 17 or 18 in a 
manner allowing the host cell to express a variant 
of Streptolysin O. 
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20. A DNA plasmid comprising a DNA vector and a 
DNA sequence according to claim 13, 14, 15, 16, 17, 
or 18. 

5 21. A prokaryotic or eukaryotic host cell 

transformed or trans feet ed with a DNA plasmid 
according to claim 20. 

22. The DNA plasmid of claim 21 wherein said DNA 
10 vector further comprises a promoter. 

23 . The DNA plasmid of claim 21 wherein said DNA 
vector further comprises a regulated promoter. 

15 24. The DNA plasmid of claim 21 wherein said DNA 

vector further comprises a regulated promoter 
selected from the group consisiting of p L , p^, and 

PT7- 



25. A purified variant version of Streptolysin O 
characterized by comprising at least one epitope 
characteristic of wild-type Streptolysin O, being 
soluble upon expression and evidencing less than 
about 1.5% of the hemolytic activity of wild-type 
Streptolysin O having a hemolytic activity of about 
4 x 10* hemolytic units per milligram of said wild 
type Streptolysin O. 

26. The variant version of Streptolysin O of claim 
25 evidencing less than about 0.5% of the hemolytic 
activity of wild-type Streptolysin O, said wild- type 
Streptolysin O having a hemolytic activity of about 
4 x 10 5 hemolytic units per milligram. 

27. The variant version of Streptolysin 0 of claim 
25 evidencing less than about 0.1% of the hemolytic 
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activity of wild-type Streptolysin O, said wild-type 
SLO having a hemolytic activity of about 4 x 10 5 
hemolytic units per milligram. 

5 28. A purified variant version of Streptolysin 0 

having a specific hemolytic activity of less than 
about 400HU/mg and being soluble upon expression. 

29. A process for obtaining a purified and 
10 substantially non-hemolytically active variant of 

Streptolysin 0 which is soluble upon expression 
comprising the steps of: 

(a) introducing a fragment of the Streptolysin O 
gene in translational fused or unfused form soluble, 

15 said gene being caspanle of producing soluble, 

hemolytically active Streptolysin O, into a host 
capable of producing high levels of mutagenic 
activity, to obtain a mutation fragment candidate; 

(b) introducing said mutation fragment candidate 
20 into an appropriate vector to obtain a plasmid; 

(c) transforming a host capable of expressing the 
mutation fragment candidate with the plasmid to 
obtain a mutation protein candidate; and 

(d) screening said mutation protein candidate for an 
25 absence of hemolysis of red blood cells. 



SUBSTITUTE SHEET 



WO 93/05155 



PCT/US92/06380 



1/8 



ATG GAT CCG TCA GAA GAC AAA AAA AAG AGC GAA GAA GAT CAC ACT GAA 48 

GAA ATC AAT GAC AAG ATT TAT TCA CTA AAT TAT AAT GAG CTT GAA GTA 96 

CTT GCT AAA AAT GGT GAA ACC ATT GAA AAT TTT GTT CCT AAA GAA GGC 144 

GTT AAG AAA GCT GAT AAA TTT ATT GTC ATT GAA AGA AAG AAA AAA AAT 192 

ATC AAC ACT ACA CCA GTC GAT ATT TCC ATC ATT GAC TCT GTC ACT GAT 240 

AGG ACC TAT CCA GCA GCC CTT CAG CTG GCT AAT AAA GGT TTT ACC GAA 288 

AAC AAA CCA GAC GCG GTA GTC ACC AAG CGA AAC CCA CAA AAA ATC CAT 336 

ATT GAT TTA CCA GGT ATG GGA GAC AAA GCA ACG GTT GAG GTC AAT GAC 384 

CCT ACC TAT GCC AAT GTT TCA ACA GCT ATT GAT AAT CTT GTT AAC CAA 432 

TGG CAT GAT AAT TAT TCT GGT GGT AAT ACG CTT CCT GCC AGA ACA CAA 480 

TAT ACT AAA TCA ATG GTA TAT TCT AAG TCA CAG ATT GAA GCA GCT CTA 528 

AAT GTT AAT AGC AAA ATC TTA GAT GGT ACT TTA GGC ATT GAT TTC AAG 576 

TCG ATT TCA AAA GGT GAA AAG AAG GTG ATG ATT GCA GCA TAC AAG CAA 624 

ATT TTT TAC ACC GTA TCA GCA AAC CTT CCT AAT AAT CCT GCG GAT GTG 672 

TTT GAT AAA TCA GTG ACC TTT AAA GAG TTG CAA CGA AAA GGT GTC AGC 720 

AAT GAA GCT CCG CCA CTC TTT GTG AGT AAC GTA GCC TAT GGT CGA ACT 768 

GTT TTT GTC AAA CTA GAA ACA AGT TCT AAA AGT AAT GAT GTT GAA GCG 816 

GCC TTT AGT GCA GCT CTA AAA GGA ACA GAT GTT AAA ACT AAT GGA AAA 864 

TAC TCT GAT ATC TTA GAA AAT AGC TCA TTT ACA GCT GTC GTT TTA GGA 912 

GGA GAT GCT GCA GAG CAC AAT AAG GTA GTC ACA AAA GAC TTT GAT GTT 960 

ATT AGA AAC GTT ATC AAA GAC AAT GCT ACC TTC AGT AGA AAA AAC CCA 1008 

GCT TAT CCT ATT TCA TAC ACC AGT GTT TTC CTT AAA AAT AAT AAA ATT 1056 

GCG GGT GTC AAT AAC AGA ACT GAA TAC GTT GAA ACA ACA TCT ACC GAG 1104 

TAC ACT AGT GGA AAA ATT AAC CTG TCT CAT CAA GGC GCG TAT GTT GCT 1152 

CAA TAT GAA ATC CTT TGG GAT GAA ATC AAT TAT GAT GAC AAA GGA AAA 1200 

GAA GTG ATT ACA AAA CGA CGT TGG GAT AAC AAC TGG TAT AGT AAG ACA 1248 

TCA CCA TTT AGC ACA GTT ATC CCA CTA GGA GCT AAT TCA CGA AAT ATA 1296 

CGT ATC ATG GCT AGA GAG TGC ACC GGC TTA GCT TGG GAA TGG TGG CGA 1344 

AAA GTG ATC GAC GAA AGA GAT GTG AAA CTG TCT AAA GAA ATC AAT GTC 1392 

AAC ATC TCA GGA TCA ACC CTG AGC CCA TAT GGT TCG ATT ACT TAT AAG 1440 

TAG GAC TGG TTC AAG AGG TTC GTC AAG CAC CTT GAT GCT GCT TAT CTC 1488 

TTG AGA TCC CCG GGT AGG CCT AGT TAA CTA GTC GAC 1524 

FIG. 1 
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Met Asp Pro Ser Glu Asp Lys Lys Lys Ser Glu Glu Asp His Thr Glu 
-2-1 5 10 

Glu He Asn Asp Lys He Tyr Ser Leu Asn Tyr Asn Glu Leu Glu Val 
15 20 25 30 

Leu Ala Lys Asn Gly Glu Thr He Glu Asn Phe Val Pro Lys Glu Gly 

35 40 45 

Val Lys Lys Ala Asp Lys Phe He Val He Glu Arg Lys Lys Lys Asn 

50 55 60 

He Asn Thr Thr Pro Val Asp lie Ser He He Asp Ser Val Thr Asp 

65 70 75 

Arg Thr Tyr Pro Ala Ala Leu Gin Leu Ala Asn Lys Gly Phe Thr Glu 

80 85 90 

Asn Lys Pro Asp Ala Val Val Thr Lys Arg Asn Pro Gin. Lys He His 
95 100 105 110 

He Asp Leu Pro Gly Met Gly Asp Lys Ala Thr Val Glu Val Asn Asp 

115 120 125 

Pro Thr Tyr Ala Asn Val Ser Thr Ala He Asp Asn Leu Val Asn Gin 

130 135 140 

Trp His Asp Asn Tyr Ser Gly Gly Asn Thr Leu Pro Ala Arg Thr Gin 

145 150 155 

Tyr Thr Lys Ser Met Val Tyr Ser Lys Ser Gin He Glu Ala Ala Leu 

160 165 " 170 

Asn Val Asn Ser Lys He Leu Asp Gly Thr Leu Gly He Asp Phe Lys 
175 180 185 190 

Ser He Ser Lys Gly Glu Lys Lys Val Met He Ala Ala Tyr Lys Gin 

195 " 200 205 

He Phe Tyr Thr Val Ser Ala Asn Leu Pro Asn Asn Pro Ala Asp Val 

210 215 220 

Phe Asp Lys Ser Val Thr Phe Lys Glu Leu Gin Arg Lys Gly Val Ser 

225 230 235 

Asn Glu Ala Pro Pro Leu Phe Val Ser Asn Val Ala Tyr Gly Arg Thr 

240 245 250 

Val Phe Val Lys Leu Glu Thr Ser Ser Lys Ser Asn Asp Val Glu Ala 
255 260 265 270 

Ala Phe Ser Ala Ala Leu Lys Gly Thr Asp Val Lys Thr Asn Gly Lys 

275 280 285 

Tyr Ser Asp He Leu Glu Asn Ser Ser Phe Thr Ala Val Val Leu Gly 

290 295 300 

Gly Asp Ala Ala Glu His Asn Lys Val Val Thr Glu Asp Phe Asp Val 

305 310 315 

He Arg Asn Val He Lys Asp Asn Ala Thr Phe Ser Arg Lys Asn Pro 

320 325 330 

Ala Tyr Pro He Ser Tyr Thr Ser Val Phe Leu Lys Asn Asn Lys He 
335 340 345 350 

Ala Gly Val Asn Asn Arg Thr Glu Tyr Val Glu Thr Thr Ser Thr Glu 

355 360 365 

Tyr Thr Ser Gly Lys He Asn Leu Ser His Gin Gly Ala Tyr Val Ala 

370 375 380 

Gin Tyr Glu He Leu Trp Asp Glu He Asn Tyr Asp Asp Lys Gly Lys 

385 390 395 

Glu Val He Thr Lys Arg Arg Trp Asp Asn Asn Trp Tyr Ser Lys Thr 

400 405 410 

Ser Pro Phe Ser Thr Val He Pro Leu Gly Ala Asn Ser Arg Asn He 
415 420 425 430 

Arg He Met Ala Arg Glu Cys Thr Gly Leu Ala Trp Glu Trp Trp Arg 

435 440 445 

Lys Val He Asp Glu Arg Asp Val Lys Leu Ser Lys Glu He Asn Val 

450 455 460 

Asn He Ser Gly Ser Thr Leu Ser Pro Tyr Gly Ser lie Thr Tyr Lys 
465 470 475 



FIG. 2 
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ATG GAT CCG TCA GAA GAC AAA AAA AAG AGC GAA GAA GAT CAC ACT GAA 48 

GAA ATC AAT GAC AAG ATT TAT TCA CTA AAT TAT AAT GAG CTT GAA GTA 96 

CTT GCT AAA AAT GGT GAA ACC ATT GAA AAT TTT GTT CCT AAA GAA GGC 144 

GTT AAG AAA GCT GAT AAA TTT ATT GTC ATT GAA AGA AAG AAA AAA AAT 192 

ATC AAC ACT ACA CCA GTC GAT ATT TCC ATC ATT GAC TCT GTC ACT GAT 240 

AGG ACC TAT CCA GCA GCC CTT CAG CTG GCT AAT AAA GGT TTT ACC GAA 288 

AAC AAA CCA GAC GCG GTA GTC ACC AAG CGA AAC CCA CAA AAA ATC CAT 336 

ATT GAT TTA CCA GGT ATG GGA GAC AAA GCA ACG GTT GAG GTC AAT GAC 384 

CCT ACC TAT GCC AAT GTT TCA ACA GCT ATT GAT AAT CTT GTT AAC CAA 432 

TGG CAT GAT AAT TAT TCT GGT GGT AAT ACG CTT CCT GCC AGA ACA CAA 480 

TAT ACT GAA TCA ATG GTA TAT TCT AAG TCA CAG ATT GAA GCA GCT CTA 528 

AAT GTT AAT AGC AAA ATC TTA GAT GGT ACT TTA GGC ATT GAT TTC AAG 576 

TCG ATT TCA AAA GGT GAA AAG AAG GTG ATG ATT GCA GCA TAC AAG CAA 624 

ATT TTT TAC ACC GTA TCA GCA AAC CTT CCT AAT AAT CCT GCG GAT GTG 672 

TTT GAT AAA TCA GTG ACC TTT AAA GAG TTG CAA CGA AAA GGT GTC AGC 720 

AAT GAA GCT CCG CCA CTC TTT GTG AGT AAC GTA GCC TAT GGT CGA ACT 768 

GTT TTT GTC AAA CTA GAA ACA AGT TCT AAA AGT AAT GAT GTT GAA GCG 816 

GCC TTT AGT GCA GCT CTA AAA GGA ACA GAT GTT AAA ACT AAT GGA AAA 864 

TAC TCT GAT ATC TTA GAA AAT AGC TCA TTT ACA GCT GTC GTT TTA GGA 912 

GGA GAT GCT GCA GAG CAC AAT AAG GTA GTC ACA AAA GAC TTT GAT GTT 960 

ATT AGA AAC GTT ATC AAA GAC AAT GCT ACC TTC AGT AGA AAA AAC CCA 1008 

GCT TAT CCT ATT TCA TAC ACC AGT GTT TTC CTT AAA AAT AAT AAA ATT 1056 

GCG GGT GTC AAT AAC AGA ACT GAA TAC GTT GAA ACA ACA TCT ACC GAG 1104 

TAC ACT AGT GGA AAA ATT AAC CTG TCT CAT CAA GGC GCG TAT GTT GCT 1152 

CAA TAT GAA ATC CTT TGG GAT GAA ATC AAT TAT GAT GAC AAA GGA AAA 1200 

GAA GTG ATT ACA AAA CGA CGT TGG GAT AAC AAC TGG TAT AGT AAG ACA 1248 

TCA CCA TTT AGC ACA GTT ATC CCA CTA GGA GCT AAT TCA CGA AAT ATA 1296 

CGT ATC ATG GCT AGA GAG TGC ACC GGC TTA GCT TGG GAA TGG TGG CGA 1344 

AAA GTG ATC GAC GAA AGA GAT GTG AAA CTG TCT AAA GAA ATC AAT GTC 1392 

AAC ATC TCA GGA TCA ACC CTG AGC CCA TAT GGT TCG ATT ACT TAT AAG 1440 

TAG GAC TGG TTC AAG AGG TTC GTC AAG CAC CTT GAT GCT GCT TAT CTC 1488 

TTG AGA TCC CCG GGT AGG CCT AGT TAA CTA GTC GAC 1524 
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Met Asp Pro Ser Glu Asp Lys Lys Lys Ser Glu Glu Asp His Thr Glu 
-2 -1 5 10 

Glu He Asn Asp Lys lie Tyr Ser Leu Asn Tyr Asn Glu Leu Glu Val 
15 20 25 30 

Leu Ala Lys Asn Gly Glu Thr He Glu Asn Phe Val Pro Lys Glu Gly 

35 40 45 

Val Lys Lys Ala Asp Lys Phe He Val lie Glu Arg Lys Lys Lys Asn 

50 55 60 

He Asn Thr Thr Pro Val Asp He Ser He He Asp Ser Val Thr Asp 

65 70 75 

Arg Thr Tyr Pro Ala Ala Leu Gin Leu Ala Asn Lys Gly Phe Thr Glu 

80 85 90 

Asn Lys Pro Asp Ala Val Val Thr Lys Arg Asn Pro Gin Lys lie His 
95 100 105 110 

He Asp Leu Pro Gly Met Gly Asp Lys Ala Thr Val Glu Val Asn Asp 

115 120 125 

Pro Thr Tyr Ala Asn Val Ser Thr Ala He Asp Asn Leu Val Asn Gin 

130 135 140 

Trp His Asp Asn Tyr Ser Gly Gly Asn Thr Leu Pro Ala Arg Thr Gin 

145 150 155 

Tyr Thr Glu Ser Met Val Tyr Ser Lys Ser Gin He Glu Ala Ala Leu 

160 165 170 

Asn Val Asn Ser Lys He Leu Asp Gly Thr Leu Gly He Asp Phe Lys 
175 180 185 190 

Ser He Ser Lys Gly Glu Lys Lys Val Met He Ala Ala Tyr Lys Gin 

195 200 205 

He Phe Tyr Thr Val Ser Ala Asn Leu Pro Asn Asn Pro Ala Asp Val 

210 215 220 

Phe Asp Lys Ser Val Thr Phe Lys Glu Leu Gin Arg Lys Gly Val Ser 

225 230 235 

Asn Glu Ala Pro Pro Leu Phe Val Ser Asn Val Ala Tyr Gly Arg Thr 

240 245 250 

Val Phe Val Lys Leu Glu Thr Ser Ser Lys Ser Asn Asp Val Glu Ala 
255 260 265 270 

Ala Phe Ser Ala Ala Leu Lys Gly Thr Asp Val Lys Thr Asn Gly Lys 

275 280 285 

Tyr Ser Asp He Leu Glu Asn Ser Ser Phe Thr Ala Val Val Leu Gly 

290 295 300 

Gly Asp Ala Ala Glu His Asn Lys Val Val Thr Glu Asp Phe Asp Val 

305 310 315 

He Arg Asn Val lie Lys Asp Asn Ala Thr Phe Ser Arg Lys Asn Pro 

320 325 330 

Ala Tyr Pro He Ser Tyr Thr Ser Val Phe Leu Lys Asn Asn Lys He 
335 340 345 350 

Ala Gly Val Asn Asn Arg Thr Glu Tyr Val Glu Thr Thr Ser Thr Glu 

355 360 365 

Tyr Thr Ser Gly Lys He Asn Leu Ser His Gin Gly Ala Tyr Val Ala 

370 375 380 

Gin Tyr Glu He Leu Trp Asp Glu He Asn Tyr Asp Asp Lys Gly Lys 

385 390 395 

Glu Val He Thr Lys Arg Arg Trp Asp Asn Asn Trp Tyr Ser Lys Thr 

400 405 410 

Ser Pro Phe Ser Thr Val He Pro Leu Gly Ala Asn Ser Arg Asn He 
415 420 425 430 

Arg He Met Ala Arg Glu Cys Thr Gly Leu Ala Trp Glu Trp Trp Arg 

435 440 445 

Lys Val He Asp Glu Arg Asp Val Lys Leu Ser Lys Glu He Asn Val 

450 " 455 460 

Asn He Ser Gly Ser Thr Leu Ser Pro Tyr Gly Ser He Thr Tyr Lys 
465 470 475 
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